Salt tolerant organisms

ABSTRACT

Disclosed herein are transformed non-vascular photosynthetic organisms that are salt tolerant, nucleotides and vectors useful in conducting such transformations, and transformed strains produced by such transformations.

CROSS REFERENCE TO RELATED APPLICATION

This application is a national phase under 35 U.S.C. §371 ofInternational Application Number PCT/US2010/048991, filed Sep. 15, 2010,which claims the benefit of U.S. Provisional Application No. 61/242,574,filed Sep. 15, 2009, and also claims the benefit of U.S. ProvisionalApplication No. 61/301,735, filed Feb. 5, 2010, the entire contents ofboth provisional applications are incorporated by reference for allpurposes.

INCORPORATION OF SEQUENCE LISTING

This application contains a Sequence Listing which has been submitted inASCII format via EFS-Web and is hereby incorporated by reference in itsentirety. Said ASCII copy, created. on Jul. 8, 2013, is named0742US_UTL2_ST25.txt and is 897,057 bytes in size.

INCORPORATION BY REFERENCE

All publications, patents, and patent applications mentioned in thisspecification are herein incorporated by reference to the same extent asif each individual publication, parent, or patent application wasspecifically and individually indicated to be incorporated by reference.

BACKGROUND

Algae are highly adaptable plants that are capable of rapid growth undera wide range of conditions. As photosynthetic organisms, they have thecapacity to transform sunlight into energy that can be used tosynthesize a variety of biomolecules for use as industrial enzymes,therapeutic compounds and proteins, nutritional, commercial, or fuelproducts, etc.

The majority of algal species are adapted to growth in an aqueousenvironment, and are easily grown in liquid media using light as anenergy source. The ability to grow algae on a large scale in an outdoorsetting, in ponds or other open or closed containers, using sunlight forphotosynthesis, enhances their utility for bioproduction, environmentalremediation, and carbon fixation.

SUMMARY

Provided herein is a non-vascular photosynthetic organism comprising atleast one mutation in any one of SEQ ID NO. 115, SEQ ID NO. 116, SEQ IDNO. 122, SEQ ID NO. 201, SEQ ID NO. 129, SEQ ID NO. 202, SEQ ID NO. 206.SEQ ID NO. 133, SEQ ID NO. 198, SEQ ID NO. 200, SEQ ID NO. 204, SEQ IDNO. 203, SEQ ID NO. 135, SEQ ID NO. 134, SEQ ID NO. 166, SEQ ID NO. 162,SEQ ID NO. 169, SEQ ID NO. 168, SEQ ID NO. 170, SEQ ID NO. 175, SEQ IDNO. 127, SEQ ID NO. 230, SEQ ID NO. 231, SEQ ID NO. 233 and SEQ ID NO.234 or a sequence having at least 95% sequence identity to any of thepreceding sequences, wherein the at least one mutation comprises one ormore nucleotide additions, deletions and/or substitutions and theorganism has an increased growth rate in an aqueous environmentcontaining between about 75 mM and 275 mM sodium chloride as compared tothe same organism without the at least one mutation.

The at least one mutation can be in a coding region where it may resultin one or more amino acid additions, deletions and/or substitutions. Theone or more mutations can also be in regulatory regions such as a 5′ UTRregion or a 3′ UTR region. In one embodiment the at least one mutationis located in a promoter region.

In one embodiment, the activity of a protein encoded by any one of SEQID NO. 115, SEQ ID NO. 116, SEQ ID NO. 122, SEQ ID NO. 201, SEQ ID NO.129, SEQ ID NO. 202, SEQ ID NO. 206, SEQ ID NO. 133, SEQ ID NO. 198, SEQID NO. 200, SEQ ID NO. 204, SEQ ID NO. 203, SEQ ID NO. 135, SEQ ID NO.134, SEQ ID NO. 166, SEQ ID NO. 162, SEQ ID NO. 169, SEQ ID NO. 168, SEQID NO. 170, SEQ ID NO. 175, SEQ ID NO. 127, SEQ ID NO. 230, SEQ ID NO.231, SEQ ID NO. 233 and SEQ ID NO. 234 or a protein having at least 95%amino acid sequence identity to a protein encoded by any of thepreceding sequences is decreased by the presence of the at least onemutation as compared to the protein without the at least one mutation.The activity of the protein may be decreased by at least 10%, at least20%, at least 30%, at least 40%, at least 50%, at least 60%, at least70%, at least 80%, at least 90% or 100% (i.e. inactive).

In other embodiments, the organism with the at least one mutation has agrowth rate that is at least 10%, at least 20%, at least 30%, at least40%, at least 50%, at least 60%, at least 70%, at least 80%, at least90%, at least 100%, at least 125%, at least 150%, at least 175%, atleast 200%, at least 225%, at least 250%, at least 275%, at least 300%,at least 325%, at least 350%, at least 375%, at least 400%, at least425%, at least 450%, at least 475% or at least 500% greater than theorganism without the at least one mutation.

In further embodiments, the presence of the at least one mutationresults in a transcription rate of any of the preceding nucleotidesequences that is decreased by at least 10%, at least 20%, at least 30%,at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, atleast 90% or 100% (no detectable transcripts) as compared totranscription in the same organism without the at least one mutation. Inother embodiments, the presence of the at least one mutation results ina decrease in the translation of a protein encoded by any of thepreceding nucleotide sequences by at least 10%, at least 20%, at least30%, at least 40%, at least 50%, at least 60%, at least 70%, at least80%, at least 90% or 100% (no detectable translation) as compared totranslation in the same organism without the at least one mutation.

Another embodiment provides at genetically modified non-vascularphotosynthetic organism comprising at least one RNAi agent comprising anantisense nucleotide sequence that is complementary to mRNA transcribedfrom any one of SEQ ID NO. 115, SEQ ID NO. 116, SEQ ID NO. 122, SEQ IDNO. 201, SEQ ID NO. 129, SEQ ID NO. 202, SEQ ID NO. 206, SEQ ID NO. 133,SEQ ID NO. 198, SEQ ID NO. 200, SEQ ID NO. 204, SEQ ID NO. 203, SEQ IDNO. 135, SEQ ID NO. 134, SEQ ID NO. 166, SEQ ID NO. 162, SEQ ID NO. 169,SEQ ID NO. 168, SEQ ID NO. 170, SEQ ID NO. 175, SEQ ID NO. 27, SEQ IDNO. 230, SEQ ID NO. 231, SEQ ID NO. 233 and SEQ ID NO. 234 or a sequencehaving at least 95% sequence identity to any of the preceding sequencesand in which the organism has an increased growth rate in an aqueousenvironment containing between about 75 mM and 275 mM sodium chloride ascompared to the organism not modified with the at least one RNAi agent.In certain embodiments, the at least one RNAi agent is a microRNA(miRNA) or a small interfering RNA (siRNA).

In one embodiment the activity of a protein encoded by any one of SEQ IDNO. 115, SEQ ID NO. 116, SEQ ID NO. 122, SEQ ID NO. 201, SEQ ID NO. 129,SEQ ID NO. 202, SEQ ID NO. 206, SEQ ID NO. 133, SEQ ID NO. 198, SEQ IDNO. 200, SEQ ID NO. 204, SEQ ID NO. 203, SEQ ID NO. 135, SEQ ID NO. 134,SEQ ID NO. 166, SEQ ID NO. 162, SEQ ID NO. 169, SEQ ID NO. 168, SEQ IDNO. 170, SEQ ID NO. 175, SEQ ID NO. 127, SEQ ID NO. 230, SEQ ID NO. 231,SEQ ID NO. 233 and SEQ ID NO. 234 or a protein having at least 95% aminoacid sequence identity to a protein encoded by any one of the precedingsequences is decreased as compared to the protein in the same organismwhich is not modified with the at least one RNAi agent. In certainembodiments, the activity of the protein is decreased by at least 10%,at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, atleast 70%, at least 80%, at least 90% or 100% (i.e. inactive).

In additional embodiments, the growth rate of the organism is at least10%, at least 20%, at least 30%, at least 40%, at least 50%, at least60%, at least 70%, at least 80%, at least 90%, at least 100%, at least125%, at least 150%, at least 175%, at least 200%, at least 225%, atleast 250%, at least 275%, at least 300%, at least 325%, at least 350%,at least 375%, at least 400%, at least 425%, at least 450%, at least475% or at least 500% greater than the same organism not modified withthe at least one RNAi agent.

In further embodiments, the presence of full length transcripts of anyof the preceding nucleotide sequences is decreased by at least 10%, atleast 20%, at least 30%, at least 40%, at least 50%, at least 60%, atleast 70%, at least 80%, at least 90% or 100% (no detectable full lengthtranscripts) as compared to the same organism not modified with the atleast one RNAi agent. In other embodiments, the presence of a proteinencoded by any of the preceding sequences or a protein having at least95% amino acid sequence identity to a protein encoded by any of thepreceding nucleotide sequences is decreased by at least 10%, at least20%, at least 30%, at least 40%, at least 50%, at least 60%, at least70%, at least 80%, at least 90% or 100% (no detectable protein) ascompared to the same organism not modified with the at least one RNAiagent.

In any of the above embodiments the non-vascular photosynthetic organismmay be a cyanobacterium or an alga. The alga can be a microalga or amacroalga. Non-limiting examples of microalgal species includeChlamydomonas sp, Volvacales sp, Dunaliella sp, Scenedesmus sp, Chorellasp, Hematococcus sp., Volvox sp, or Nannochloropsis sp. Particularexamples of microalgae include, but are not limited to, C. reinhardtii,N. oceanica, N. salina, D. salina, H. pluvalis, S. dimorphus, D.viridis, N. salina, N. oculata or D. tertiolecta.

In any of the preceding embodiments, the concentration of sodiumchloride in the aqueous environment is between about 250 mM and 100 mM,between about 200 mM and 100 mM, between about 175 mM and 75 mM, betweenabout 150 mM and 75 mM, between about 275 mM and 100 mM, between about275 mM and 150 mM or between about 275 mM and 200 mM.

Presented herein are non-vascular photosynthetic organisms, for example,algae that are engineered to be salt tolerance.

A salt tolerant alga as disclosed herein is transformed “knocking down”or “knocking out” one or more polynucleotides that encode one or moreproteins (see “Nucleic Acid and Amino Acid Sequences”). Algae thatinclude one or more knock out or knock down genes that confer salttolerance can be grown in concentrations of salt that can deter thegrowth of other algae and, in some embodiments, other non-algalorganisms. Also provided are algae transformed with a polynucleotidethat encodes a protein that is toxic to one or more animal species, suchas a gene encoding a Bt toxin that is lethal to insects.

Algae with one or more polynucleotides are knocked out or knocked downto prove resistance to salt are in some embodiments grown on a largescale in the presence of high concentrations of salt for the productionof biomolecules, such as, for example, therapeutic proteins, industrialenzymes, nutritional molecules, commercial products, or fuel products.Algae transformed with one or more toxin genes that are lethal to one ormore insect species can also be grown in large scale for production oftherapeutic, nutritional, fuel, or commercial products. Algaebioengineered for salt tolerance and/or to express insect toxins canalso be grown in large scale cultures for decontamination of compounds,environmental remediation, or carbon fixation.

Provided in some embodiments herein is a salt tolerant prokaryotic algatransformed to knock out or knock down a polynucleotide encoding aprotein that confers salt sensitivity. In some embodiments, the alga isa cyanobacteria species.

In some embodiments, the host alga transformed is a eukaryotic alga. Insome embodiments, the host alga is a species of the Chlorophyta. In someembodiments, the alga is a microalga. In some instances, the microalgais a Chlamydomonas species. A transformed alga having salt tolerance byinactivation of a gene in the chloroplast genome is in some embodimentshomoplastic for the inactivation.

In another embodiment, provided herein is a salt tolerantnon-chlorophyll c-containing eukaryotic alga, comprising an exogenouspolynucleotide integrated into the nuclear genome, wherein the exogenouspolynucleotide comprises a sequence that encodes a protein that confersresistance to an herbicide, wherein resistance to the herbicide isconferred by a single exogenous protein.

In another embodiment, provided herein is a non-chlorophyll c-containingeukaryotic alga, comprising a knockout or a knockdown of apolynucleotide in the nuclear genome, wherein the absence or inhibitionof the product of the polynucleotide confers resistance to salt.

Also provided herein is a salt tolerant non-chlorophyll c-containingeukaryotic alga, comprising a recombinant polynucleotide integrated intothe nuclear genome, in which the recombinant polynucleotide encodes anendogenous or exogenous EPSPS protein that confers resistance toglyphosate.

Also provided are nucleic acid constructs for transforming algae withone or more nucleotide sequences that knock out or knock down genes inorder to confer salt tolerance.

The disclosure further provides an alga comprising a recombinantpolynucleotide that encodes a Bacillus thuringiensis (Bt) toxin protein.In one embodiment, the alga includes a cry gene encoding the Bt toxin.The exogenous Bt toxin gene can be incorporated in to the nuclear genomeor the chloroplast genome of the alga. Introduction of a exogenous Bttoxin gene can interrupt and inactive nucleotides that encode a proteinsconferring sensitivity to salt, thus making the alga salt tolerance.

The disclosure further provides a salt tolerant eukaryotic algacomprising one or more recombinant polynucleotide sequences encodingproteins that confer resistance to herbicides, in which each of theproteins confers resistance to a different herbicide. In one embodiment,at least one of the polynucleotide sequences encoding a proteinconferring herbicide resistance is integrated into the chloroplastgenome of a eukaryotic alga. In one embodiment, at least one of thepolynucleotide sequences encoding a protein conferring herbicideresistance is integrated into the nuclear genome of a eukaryotic alga.In a further embodiment, at least a first of the two or morepolynucleotide sequences encoding a protein conferring herbicideresistance is integrated into the chloroplast genome and at least asecond of the two or more polynucleotide sequences encoding a proteinconferring herbicide resistance is integrated into the nuclear genome ofa eukaryotic alga.

Also provided herein is a non chlorophyll c-containing salt tolerantalga comprising a polynucleotide encoding a protein that confersresistance to an herbicide and an exogenous polynucleotide encoding aprotein that does not confer resistance to an herbicide, wherein theprotein that does not confer resistance to a herbicide is an industrialenzyme or therapeutic protein, or a protein that participates in orpromotes the synthesis of at least one nutritional, therapeutic,commercial, or fuel product, or a protein that facilitates the isolationof at least one nutritional, therapeutic, commercial, or fuel product.

Also disclosed herein are methods of producing one or more biomolecules,in which the methods include transforming an alga by knocking out orknocking down one or more polynucleotides resulting in salt tolerance,growing the alga in the presence of high salt concentrations, andharvesting one or more biomolecules from the alga or algal media. Themethods in some embodiments include isolating the one or morebiomolecules.

In some embodiments, algae are further transformed with at least oneherbicide resistance gene and at least one toxin gene, and are grown inthe presence of at least one herbicide under conditions in which thetoxin is expressed, and one or more biomolecules is harvested from thealga or algal media.

Also disclosed herein are methods of producing a biomass-degradingenzyme in an alga, in which the methods include: 1) transforming thealga by knocking out or knocking down one or more polynucleotides and soconferring salt tolerance to the alga, and a sequence encoding anexogenous biomass-degrading enzyme which promotes increased expressionof an endogenous biomass-degrading enzyme; and 2) growing the alga inthe presence of high salt concentrations and under conditions whichallow for production of the biomass-degrading enzyme, in which the saltis in sufficient concentration to inhibit growth of the alga, which doesnot include the knock out or knock down conferring salt tolerance, toproducing the biomass-degrading enzyme. The methods in some embodimentsinclude isolating the biomass-degrading enzyme.

BRIEF DESCRIPTION OF THE DRAWINGS

These and other features, aspects, and advantages of the presentdisclosure will become better understood with regard to the followingdescription, appended claims and accompanying figures where:

FIG. 1 shows an exemplary vector, SENuc391 used in the transformation ofthe nuclear genome of Chlamydomonas reinhardtii to express an artificialmiRNA. The hygromycin resistance gene is indicated by “Aph 7”. It ispreceded by the C. reinhardtii Beta2-tubulin promoter and followed bythe C. reinhardtii rbcS2 terminator. The first intron from the C.reinhardtii rbcS2 gene was inserted within Aph 7″ to increase expressionlevels and consequentially, the number of transformants. The paromomycinresistance gene is indicated by “Aph VIII”. It is preceded by the C.reinhardtii psaD promoter and followed by the C. reinhardtii psaDterminator. The segment labeled “Hybrid Promoter” which consists of afused promoter beginning with the C. reinhardtii Hsp70A promoter, C.reinhardtii rbcS2 promoter, and the first intron from the C. reinhardtiirbcS2 gene drives the expression of the cre-MIR1157 precursor scaffold.The precursor scaffold is followed by the terminator from the C.reinhardtii rbcS2 gene.

FIG. 2 shows the secondary structure of the miRNA precursor cre-MIR1157found in Chlamydomonas reinhardtii. The label “RE site” indicates therestriction site used to ligate artificial miRNAs.

FIG. 3 shows a representative miRNA*-loop-miRNA fragment and the BglIIrestriction site used to ligate into SENuc391.

FIG. 4 shows an expression cassette containing the coding sequence forboth the zeocin resistance gene (ble) and the xylanase gene (BD12)linked by the Foot-and-mouth disease virus peptide 2A. The 2A sequenceresults in a single mRNA transcript, but two polypeptides. RNAinterference of the BD12 transcript will result in both a decrease ofBD12 protein, BD12 activity, and zeocin resistance.

FIG. 5 shows analysis of 12 transformants containing the BD12 silencingcassette followed by a wildtype control labeled “21gr” and aBD12-containing strain without the BD12 cassette. A BD12 gene screencontrol (row A); a western blot (row B); sensitivity to solid TAPmedia+10 μg/mL zeocin (row C); and sensitivity to solid TAP media+40μg/mL zeocin (row D) were performed to demonstrate the variance ofknockdown as a product of individual transformation events. As BD12expression is silenced, BD12 protein levels decrease along with anincrease to zeocin sensitivity.

FIG. 6 shows analysis of lysates and cDNA preps of 12 transformantscontaining the BD12 silencing cassette followed by a wildtype controllabeled “21gr” and a BD12-containing strain without the BD12 silencingcassette. The left-hand y axis is transcript level normalized to thecontrol labeled “BD12+”; the right-hand y axis is xylanase activity(units/s); the x axis represents each of the 12 transformants includingpositive and negative controls. The bars represent the BD12 relativetranscript abundance as determined by quantitative PCR; and the solidline represents xylanase activity. As BD12 expression is silenced, BD12transcript levels decrease along with a decrease in xylanases activity.

FIG. 7 shows the cre-MIR1157 nucleotide sequence (SEQ ID NO: 247) thatwas amplified from Chlamydomonas reinhardtii CC-1690 (mt+) genomic DNAvia PCR. The location of the endogenous miRNA*-loop-miRNA sequences areindicated by “boxes.”

FIG. 8 shows an exemplary vector, SENuc 146 used in the transformationof the nuclear genome of Chlamydomonas reinhardtii to generate the genedisruption library. The hygromycin resistance gene is indicated by “Aph7”. It is preceded by the C. reinhardtii Beta2-tubulin promoter andfollowed by the C. reinhardtii rbcS2 terminator. The first intron fromthe C. reinhardtii rbcS2 gene is inserted within Aph 7″ to increaseexpression levels and consequentially, the number of transformants.Following the rbcS2 terminator is the segment labeled “Hybrid Promoter”which consists of a fused promoter beginning with the C. reinhardtiiHsp70A promoter, C. reinhardtii rbcS2 promoter, and the first intronfrom the C. reinhardtii rbcS2 gene.

FIG. 9 shows an exemplary vector, SENuc 140 used in the transformationof the nuclear genome of Chlamydomonas reinhardtii to generate the genedisruption library. The paromomycin resistance gene is indicated by “AphVIII”. It is preceded by the C. reinhardtii psaD promoter and followedby the C. reinhardtii psaD terminator. Following the psaD terminator isthe segment labeled “Hybrid Promoter” which consists of a fused promoterbeginning with the C. reinhardtii Hsp70A promoter, C. reinhardtii rbcS2promoter, and the first intron from the C. reinhardtii rbcS2 gene.

FIG. 10 shows S7 knockdown clones. The y axis is relative transcriptabundance of the S7 gene and the x axis represents 5 individual clones(S7-1, S7-2, S7-3, S7-4, and S7-5), wildtype C. reinhardtii (21gr), andthe S7 gene disruption strain (S7 KO). Also, salt tolerant strainsisolated had reduced transcript levels.

FIG. 11 shows S16 knockdown clones. The y axis is relative transcriptabundance of the S16 gene and the x axis represents 5 individual clones(S16-1, S16-2, S16-3, S16-4, and S16-5), wildtype C. reinhardtii (21gr),and the S16 gene disruption strain (S16 KO). Also, salt tolerant strainsisolated had reduced transcript levels.

FIG. 12 shows two artificial miRNA transformations targeting S7 (rows 2and 3) on a gradient plate from 0 mM to 200 mM sodium chloride. WildtypeC. reinhardtii (21gr) was plated on the top row. Transformants of theartificial miRNA targeting S7 was characterized as having increased salttolerance.

FIG. 13 shows two artificial miRNA transformations targeting S7 (rows 2and 3) on a gradient plate from 0 mM to 200 mM sodium chloride. WildtypeC. reinhardtii (21gr) was plated on the top row. The original S7 genedisruption strain (acquired through the generation of the library) andlabeled “SH7 (S7) knockout” was plated on the bottom row. Transformantsof the artificial miRNA targeting S7 and the originally S7 genedisruption strain was characterized as having increased salt tolerance.

FIG. 14 shows two artificial miRNA transformations targeting S16 (rows 2and 3) on a gradient plate from 0 mM to 200 mM sodium chloride. WildtypeC. reinhardtii (21gr) was plated on the top row. Transformants of theartificial miRNA targeting S16 was characterized as having increasedsalt tolerance.

FIG. 15 shows 42 knockdown colonies for S65 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S65” refersto the Plate ID #S65, strain number S65, and Augustus v.5 Protein ID:517886. See Table 1.

FIG. 16 shows 42 knockdown colonies for S1659 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S59”refers to the Plate ID #S59, strain number S1659, and Protein ID:178706. See Table 1.

FIG. 17 shows 42 knockdown colonies for S77 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S77” refersto the Plate ID #S77, strain number S77, and Augustus v.5 Protein ID:522165. See Table 1.

FIG. 18 shows 42 knockdown colonies for S1666 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S66”refers to the Plate ID #S66, strain number S1666, and Augustus v.5Protein ID: 514721. See Table 1.

FIG. 19 shows 42 knockdown colonies for S1704 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S109”refers to the Plate ID #S109, strain number S1704, and Protein ID:77062. See Table 1.

FIG. 20 shows 42 knockdown colonies for S105 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S105”refers to the Plate ID #S105, strain number S105, and Augustus v.5Protein ID: 524679. See Table 1.

FIG. 21 shows 42 knockdown colonies for S1612 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S12”refers to the Plate ID #S12, strain number S1612, and Protein ID:103075. See Table 1.

FIG. 22 shows 42 knockdown colonies for S1644 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S44”refers to the Plate ID #544, strain number S1644, and Protein ID:331285. See Table 1.

FIG. 23 shows 42 knockdown colonies for S1693 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S120”refers to the Plate ID #5120, strain number S1693, and Protein ID:188114. See Table 1.

FIG. 24 shows 42 knockdown colonies for S1687 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S114”refers to the Plate ID #S114, strain number S1687, and Protein ID:291633. See Table 1.

FIG. 25 shows 42 knockdown colonies for S129 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S129”refers to the Plate ID #S129, strain number S129, and Augustus v.5Protein ID: 510051. See Table 1.

FIG. 26 shows 42 knockdown colonies for S123 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S123”refers to the Plate ID #S123, strain number S123, and Augustus v.5Protein ID: 519822. See Table 1.

FIG. 27 shows 42 knockdown colonies for S289 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S289”refers to the Plate ID #S289, strain number S289, and Augustus v.5Protein ID: 518128. See Table 1.

FIG. 28 shows 42 knockdown colonies for S276 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S276”refers to the Plate ID #S276, strain number S276, and Augustus v.5Protein ID: 512487. See Table 1.

FIG. 29 shows 42 knockdown colonies for S292 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S292”refers to the Plate ID #S292, strain number S292, and Augustus v.5Protein ID: 524030. See Table 1.

FIG. 30 shows 42 knockdown colonies for S291 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S291”refers to the Plate ID #S291, strain number S291, and Augustus v.5Protein ID: 516191. See Table 1.

FIG. 31 shows 42 knockdown colonies for S294 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S294”refers to the Plate ID #S294, strain number S294, and Augustus v.5Protein ID: 522637. See Table 1.

FIG. 32 shows 42 knockdown colonies for S338 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “S338”refers to the Plate ID #S338, strain number S338, and Augustus v.5Protein ID: 512361. See Table 1.

FIG. 33 shows 42 knockdown colonies for S74 with controls at 75 mM, 100mM, 125 mM sodium chloride (left to right). The image label “574” refersto the Plate ID #S74, strain number S74, and Augustus v.5 Protein ID:520845. See Table 1.

FIG. 34 shows 42 knockdown colonies for S1613 with controls at 75 mM,100 mM, 125 mM sodium chloride (left to right). The image label “S1613”refers to the Plate ID #S1613, strain number S1613, and Augustus v.5Protein ID: 174261. See Table 1.

FIG. 35 shows 42 colonies for S1621 with controls at 75 mM, 100 mM, and125 mM sodium chloride (left to right). The image label “S1621” refersto the Plate ID #S1621, strain number S1621, and Protein ID: 206559. SeeTable 1.

FIG. 36 shows 42 colonies for S1623 with controls at 75 mM, 100 mM, and125 mM sodium chloride (left to right). The image label “S1623” refersto the Plate ID #S1623, strain number S1623, and Protein ID: 116145. SeeTable 1.

FIG. 37 shows 42 colonies for S1638 with controls at 75 mM, 100 mM, and125 mM sodium chloride (left to right). The image label “S1638” refersto the Plate ID #31638, strain number S1638, and Protein ID: 418706. SeeTable 1.

FIG. 38 shows 42 colonies for S1655 with controls at 75 mM, 100 mM, and125 mM sodium chloride (left to right). The image label “S1655” refersto the Plate ID #S1655, strain number S1655, and Augustus v.5 ProteinID: 525078. See Table 1.

FIG. 39 shows S77 knockdown clones. The y axis is relative transcriptabundance of the S77 gene and the x axis represents 6 individual clones(S77-1, S77-2, S77-3, S77-4, S77-5, and S77-6), wildtype C. reinhardtii(21gr), and the S77 gene disruption strain (S77 KO). The lower half ofthe figure shows the sensitivity to NaCl of the 6 individual knockdownclones, wild type C. reinhardtii (21gr), and the S77 gene disruptionstrain (left to right respectively). Decreased levels of transcript(strains S77-1, S77-2, and S77-3) correspond to increased NaClresistance. Higher levels of transcript (strains S77-4, S77-5, andS77-6) correspond to increased NaCl sensitivity. Top row is 75 mM NaCl,middle row is 100 mM NaCl, and bottom row is 125 mM NaCl.

FIG. 40 shows S338 knockdown clones. The y axis is relative transcriptabundance of the S338 gene and the x axis represents 6 individual clones(S338-1, S338-2, S338-3, S338-4, S338-5, and S338-6), wildtype C.reinhardtii (21 gr), and the S338 gene disruption strain (S338 KO). Thelower half of the figure shows the sensitivity to NaCl of the 6individual knockdown clones, wild type C. reinhardtii (21gr), and theS338 gene disruption strain (left to right respectively). Decreasedlevels of transcript (strains S338-1, S338-2, S338-3, S338-5, andS338-6) correspond to increased NaCl resistance. Top row is 75 mM NaCl,middle row is 100 mM NaCl, and bottom row is 125 mM NaCl.

FIG. 41 shows the segregation analysis results for strain S7 of 5strains resistant to hygromycin and 5 strains sensitive to hygromycin.The 5 strains resistant to hygromycin are also tolerant to liquid G₀media+75 mM NaCl whereas the 5 strains sensitive to hygromycin do notgrow in liquid G₀ media+75 mM NaCl. These results show that thephenotype (salt tolerance) is genetically linked to the antibioticselection marker or gene disruption.

DETAILED DESCRIPTION

The following detailed description is provided to aid those skilled inthe art in practicing the present disclosure. Even so, this detaileddescription should not be construed to unduly limit the presentdisclosure as modifications and variations in the embodiments discussedherein can be made by those of ordinary skill in the art withoutdeparting from the spirit or scope of the present disclosure.

As used in this specification and the appended claims, the singularforms “a”, “an” and “the” include plural reference unless the contextclearly dictates otherwise.

Endogenous

An endogenous nucleic acid, nucleotide, polypeptide, or protein asdescribed herein is defined in relationship to the host organism. Anendogenous nucleic acid, nucleotide, polypeptide, or protein is one thatnaturally occurs in the host organism.

Exogenous

An exogenous nucleic acid, nucleotide, polypeptide, or protein asdescribed herein is defined in relationship to the host organism. Anexogenous nucleic acid, nucleotide, polypeptide, or protein is one thatdoes not naturally occur in the host organism or is a different locationin the host organism.

Salt Tolerant

The relative growth of a non-vascular photosynthetic organism in thepresence of salinity is termed its salt tolerance. Salt tolerance is theability of a modified non-vascular photosynthetic organism to display animproved response to an increase in extracellular and/or intracellularconcentration of salt including, but not limited to, Na+, Li+ and K+, ascompared to an unmodified organism. Increased salt tolerance may bemanifested by phenotypic characteristics including, for example, longerlife span, increased growth rate, increase productivity, apparent normalgrowth and function of the plant, and/or a decreased level of necrosis,when subjected to an increase in salt concentration, as compared to anunmodified organism.

Salt tolerance can be measured by methods known to one of skill in theart, for example, methods as described in Inan et al. (July 2004) PlantPhysiol. 135:1718, including without limitation, NaCl shock exposure orgradual increase of NaCl concentration.

Knockdown

Transcript levels are considered knocked down when an exogenous nucleicacid is transformed into a host organism to produce a RNA molecule (e.g.miRNA, siRNA) that results in RNA interference/silencing.

Knockout

A gene is considered knocked out when an exogenous nucleic acid istransformed into a host organism (e.g. by random insert-ion orhomologous recombination) resulting in the disruption (e.g. by deletion,insertion) of the gene.

Nucleic Acid and Amino Acid Sequences

Sequence locations designations described below are fromhttp://genome.jgi-psf.org/Chlre4/Chlre4.home.html (Merchant et al.,Science, 318:245-250 (2007)).

SEQ ID NO: 1 Chromosome_6:1409923-1420881-523016

SEQ ID NO: 2 Chromosome_12:1410423-1416267-512725 Protein kinase, core

SEQ ID NO: 3 Chromosome_2:7302492-7305976-519629-Armadillo-type fold

SEQ ID NO: 4 Scaffold_23:64157-70065-520112

SEQ ID NO: 5 Chromosome_2:7871590-7907018-519707-Phosphatidylinositol 3-and 4-kinase, catalytic

SEQ ID NO: 6 Chromosome_1:5106386-5121015-511373 and511374-Calcium-binding EF-hand and Protein kinase, core

SEQ ID NO: 7 511374

SEQ ID NO: 8 Chromosome_17:5617386-5620810-517886 and517887-Leucine-rich repeat

SEQ ID NO: 9 517887

SEQ ID NO: 10 Chromosome_6:6885194-6896388-524003

SEQ ID NO: 11 Chromosome_14:1451054-1454952-515336 and515337-Calcium-binding EF-hand SEQ ID NO: 12 515337

SEQ ID NO: 13 74-chromosome_3:2741639-2747917-520845 and 520844-Proteinkinase, core

SEQ ID NO: 14 520844

SEQ ID NO: 15 Chromosome_4:2134368-2137486-522165-Longin-like

SEQ ID NO: 16 Chromosome_10:3147531-3149275-510079-Zinc finger,RING-type

SEQ ID NO: 17 Chromosome_7:5889191-5892195-525104

SEQ ID NO: 18 Chromosome_16:1660154-1676212-516251

SEQ ID NO: 19 Chromosome_7:3216450-3222618-524679

SEQ ID NO: 20 Chromosome_2:8790481-8811695-519822-NSF attachment protein

SEQ ID NO: 21 Chromosome_10:2994349-2997683-510051

SEQ ID NO: 22 Chromosome_10:5445553-5448591-510417

SEQ ID NO: 23 Chromosome_3:1215915-1220466-Gene catalog-175772

SEQ ID NO: 24 Chromosome_10:1283278-1284432-509766 and509765-Profilin/allergen

SEQ ID NO: 25 509765

SEQ ID NO: 26 Scaffold_22:53087-55615-520043

SEQ ID NO: 27 Chromosome_9:1628082-1634270-526026-Major facilitatorsuperfamily

SEQ ID NO: 28 Chromosome_2:2611537-2615015-518848 and518847-Thioredoxin-like and Alkyl hydroperoxide reductase/Thiol specificantioxidant/Mal allergen

SEQ ID NO: 29 518847

SEQ ID NO: 30 Chromosome_1:1574300-1583068-510801-Protein kinase, core

SEQ ID NO: 31 Chromosome_7:61977-65285-524187-Mitochondrial substratecarrier

SEQ ID NO: 32 Chromosome_12:7667470-7670704-513869

SEQ ID NO: 33 Chromosome_2:3580095-3582359-518990

SEQ ID NO: 34 Chromosome_3:6643095-6648853-521592 and 521593

SEQ ID NO: 35 521593

SEQ ID NO: 36 Chromosome_9:1237497-1242515-525958-GTP cyclohydrolase II

SEQ ID NO: 37 Chromosome_3:5568689-5570263-521411

SEQ ID NO: 38 Chromosome_16:282424-288402-516007 and 516008-PumilioRNA-binding region

SEQ ID NO: 39 516008

SEQ ID NO: 40 Chromosome_4:1572817-1574661-522081

SEQ ID NO: 41 Chromosome_6:1447962-1450436-523024

SEQ ID NO: 42 Chromosome_16:1504475-1506533-516221-Cytochrome b5

SEQ ID NO: 43 Chromosome_1:2870207-2876411-510991 and 510992-Zincfinger, RING-type

SEQ ID NO: 44 510992

SEQ ID NO: 45 Chromosome_11:96916-101650-512152

SEQ ID NO: 46 Chromosome_10:2157223-2160721-509900

SEQ ID NO: 47 Chromosome_9:2397180-2401234-526112

SEQ ID NO: 48 Chromosome_12:5837-34345-512487-Dysferlin, N-terminal

SEQ ID NO: 49 Chromosome_5:3158380-3168988-522712

SEQ ID NO: 50 Chromosome_3:7372589-7375277-521690 and 521691

SEQ ID NO: 51 521691

SEQ ID NO: 52 Scaffold_18:812020-814034-518128 and 518129

SEQ ID NO: 53 518129

SEQ ID NO: 54 Chromosome_16:1333947-1337621-516191-NUDIX hydrolase, core

SEQ ID NO: 55 Chromosome_6:7046193-7049187-524030

SEQ ID NO: 56 Chromosome_5:2598236-2603160-522637

SEQ ID NO: 57 Chromosome_12:5998066-6000934-513600-ARF/SAR superfamily

SEQ ID NO: 58 Chromosome_12:6007846-6015554-513603 and 513602-ATPase,P-type, K/Mg/Cd/Cu/Zn/Na/Ca/Na/H-transporter

SEQ ID NO: 59 513602

SEQ ID NO: 60 Chromosome_5:599204-604348-522399-Protein kinase, core

SEQ ID NO: 61 Chromosome_11:1798315-1801943-512361-Plexin-like fold

SEQ ID NO: 62 Chromosome_1:1644920-1653379-510810 and 510811-SuppressorMra1 and Protein kinase, core

SEQ ID NO: 63 510811

SEQ ID NO: 64 Chromosome_11:1635862-1641291-512347

SEQ ID NO: 65 Chromosome_3:1573144-1573807-520646-Prolyl 4-hydroxylase,alpha subunit

SEQ ID NO: 66 Chromosome_16:1279888-1286957-516181

SEQ ID NO: 67 Chromosome_16:4386264-4397213-516652-Protease inhibitorI4, serpin

SEQ ID NO: 68 Chromosome_1:3599000-3604849-511112-Nonaspanin (TM9SF)

SEQ ID NO: 69 Chromosome_13:2255306-2255882-514509

SEQ ID NO: 70 Chromosome_10:1448801-1450093-509796-Protein kinase, core

SEQ ID NO: 71 Chromosome_3:4748750-4749429-521248-ATPase, F1 complex,delta/epsilon subunit

SEQ ID NO: 72 Chromosome_5:3179410-3179973-522714

SEQ ID NO: 73 Chromosome_3:4767136-4768060-521254-DNA/RNA helicase,C-terminal

SEQ ID NO: 74 Chromosome_1:3081958-3091707-511033

SEQ ID NO: 75 Chromosome_3:1280615-1281216-520594-Pyrrolo-quinolinequinine

SEQ ID NO: 76 Chromosome_10:6550769-6551325-510601

SEQ ID NO: 77 Chromosome_1:9020623-9021459-511987

SEQ ID NO: 78 Chromosome_6:1607759-1608312-523055-Adenylyl cyclaseclass-3/4/guanylyl cyclase

SEQ ID NO: 79 Chromosome_13:3805266-3813325-514736

SEQ ID NO: 80 Chromosome_6:1189613-1199472-522979-14-3-3 protein

SEQ ID NO: 81 Scaffold_22:71878-72579-520045-AAA+ ATPase, core

SEQ ID NO: 82 Chromosome_10:5505341-5513220-510431-ATPase, P-type,K/Mg/Cd/Cu/Zn/Na/Ca/Na/H-transporter

SEQ ID NO: 83 Chromosome_6:6387040-638749-523923

SEQ ID NO: 84 S7 transcript with UTRs.

SEQ ID NO: 85 S7 transcript without UTRs.

SEQ ID NO: 86 S7 protein sequence.

SEQ ID NO: 87 shows a 336 bp DNA fragment including the cre-MIR 1157stem-loop from C. reinhardtii CC-1690 (mt+).

SEQ ID NO: 88 shows a PCR primer. See Table 3.

SEQ ID NO: 89 shows a PCR primer. See Table 3.

SEQ ID NO: 90 shows a PCR primer. See Table 3.

SEQ ID NO: 91 shows a PCR primer. See Table 3.

SEQ ID NO: 92 shows a PCR primer. See Table 3.

SEQ ID NO: 93 shows a PCR primer. See Table 3.

SEQ ID NO: 94 shows a PCR primer. See Table 3.

SEQ ID NO: 95 shows a PCR primer. See Table 3.

SEQ ID NO: 96 shows a PCR primer. See Table 3.

SEQ ID NO: 97 shows a PCR primer. See Table 3.

SEQ ID NO: 98 shows a PCR primer. See Table 3.

SEQ ID NO: 99 shows a PCR primer. See Table 3.

SEQ ID NO: 100 shows a PCR primer. See Table 3.

SEQ ID NO: 101 shows a PCR primer. See Table 3.

SEQ ID NO: 102 shows a PCR primer. See Table 3.

SEQ ID NO: 103 shows a PCR primer. See Table 3.

SEQ ID NO: 104 shows a PCR primer. See Table 3.

SEQ ID NO: 105 shows a PCR primer. See Table 3.

SEQ ID NO: 106 shows a PCR primer. See Table 3.

SEQ ID NO: 107 shows a PCR primer. See Table 3.

SEQ ID NO: 108 shows a PCR primer. See Table 3.

SEQ ID NO: 109 shows a PCR primer. See Table 3.

SEQ ID NO: 110 shows a PCR primer. See Table 3.

SEQ ID NO: 111 shows a PCR primer. See Table 3.

SEQ ID NO: 112 shows a PCR primer. See Table 3.

SEQ ID NO: 113 shows a PCR primer. See Table 3.

SEQ ID NO: 114 shows a PCR primer. See Table 3.

SEQ ID NO: 115 S7-Augustus v.5 ID: 523016

SEQ ID NO: 116 S16-Protein ID: 195781, Augustus v.5 ID: 512725

SEQ ID NO: 117 Protein ID: 103782, Augustus v.5 ID: 519629

SEQ ID NO: 118 Protein ID: 206633, Augustus v.5 ID: 520112

SEQ ID NO: 119 Protein ID: 174337, Augustus v.5 ID: 519707

SEQ ID NO: 120 Protein ID: 146649, Augustus v.5 ID: 511373

SEQ ID NO: 121 Augustus v.5 ID: 511374

SEQ ID NO: 122 S65-Protein ID: 410325, Augustus v.5 ID: 517886

SEQ ID NO: 123 Augustus v.5 ID: 517887

SEQ ID NO: 124 Protein ID: 144919, Augustus v.5 ID: 524003

SEQ ID NO: 125 Protein ID: 340425, Augustus v.5 ID: 515336

SEQ ID NO: 126 Protein ID: 381008, Augustus v.5 ID: 515337

SEQ ID NO: 127 S74-Augustus v.5 ID: 520845

SEQ ID NO: 128 Protein ID: 343410, Augustus v.5 ID: 520844

SEQ ID NO: 129 S77-Protein ID: 136188, Augustus v.5 ID: 522165

SEQ ID NO: 130 Augustus v.5 ID: 510079

SEQ ID NO: 131 Protein ID: 393353, Augustus v.5 ID: 525104

SEQ ID NO: 132 Protein ID: 176993, Augustus v.5 ID: 516251

SEQ ID NO: 133 S105-Protein ID: 24421, Augustus v.5 ID: 524679

SEQ ID NO: 134 S123-Protein ID: 151044, Augustus v.5 ID: 519822

SEQ ID NO: 135 S129-Protein ID: 182890, Augustus v.5 ID: 510051

SEQ ID NO: 136 Augustus v.5 ID: 510417

SEQ ID NO: 137 Protein ID: 175772

SEQ ID NO: 138 Protein ID: 186281, Augustus v.5 ID: 509766

SEQ ID NO: 139 Augustus v.5 ID: 509765

SEQ ID NO: 140 Augustus v.5 ID: 520043

SEQ ID NO: 141 Protein ID: 205891, Augustus v.5 ID: 526026

SEQ ID NO: 142 Protein ID: 141568, Augustus v.5 ID: 518848

SEQ ID NO: 143 Protein ID: 182094, Augustus v.5 ID: 518847

SEQ ID NO: 144 Protein ID: 406326, Augustus v.5 ID: 510801

SEQ ID NO: 145 Protein ID: 142644, Augustus v.5 ID: 524187

SEQ ID NO: 146 Protein ID: 173319, Augustus v.5 ID: 513869

SEQ ID NO: 147 Augustus v.5 ID: 518990

SEQ ID NO: 148 Augustus v.5 ID: 521592

SEQ ID NO: 149 Protein ID: 293419, Augustus v.5 ID: 521593

SEQ ID NO: 150 Protein ID: 163238, Augustus v.5 ID: 525958

SEQ ID NO: 151 Protein ID: 137074, Augustus v.5 ID: 521411

SEQ ID NO: 152 Protein ID: 396325, Augustus v.5 ID: 516007

SEQ ID NO: 153 Protein ID: 35759, Augustus v.5 ID: 516008

SEQ ID NO: 154 Protein ID: 183391, Augustus v.5 ID: 522081

SEQ ID NO: 155 Protein ID: 182345, Augustus v.5 ID: 523024

SEQ ID NO: 156 Protein ID: 131692, Augustus v.5 ID: 516221

SEQ ID NO: 157 Protein ID: 146316, Augustus v.5 ID: 510991

SEQ ID NO: 158 Protein ID: 404865, Augustus v.5 ID: 510992

SEQ ID NO: 159 Protein ID: 394374, Augustus v.5 ID: 512152

SEQ ID NO: 160 Protein ID: 421582, Augustus v.5 ID: 509900

SEQ ID NO: 161 Protein ID: 184523, Augustus v.5 ID: 526112

SEQ ID NO: 162 S276-Augustus v.5 ID: 512487

SEQ ID NO: 163 Augustus v.5 ID: 522712

SEQ ID NO: 164 Augustus v.5 ID: 521690

SEQ ID NO: 165 Augustus v.5 ID: 521691

SEQ ID NO: 166 S289-Protein ID: 411672, Augustus v.5 ID: 518128

SEQ ID NO: 167 Protein ID: 185219, Augustus v.5 ID: 518129

SEQ ID NO: 168 S291-Protein ID: 395916, Augustus v.5 ID: 516191

SEQ ID NO: 169 S292-Protein ID: 188858, Augustus v.5 ID: 524030

SEQ ID NO: 170 S294-Protein ID: 187373, Augustus v.5 ID: 522637

SEQ ID NO: 171 S303-Protein ID: 190294, Augustus v.5 ID: 513600

SEQ ID NO: 172 Augustus v.5 ID: 513603

SEQ ID NO: 173 Protein ID: 190292, Augustus v.5 ID: 513602

SEQ ID NO: 174 Protein ID: 132979, Augustus v.5 ID: 522399

SEQ ID NO: 175 S338-Protein ID: 151163, Augustus v.5 ID: 512361

SEQ ID NO: 176 Protein ID: 178371, Augustus v.5 ID: 510810

SEQ ID NO: 177 Protein ID: 193901, Augustus v.5 ID: 510811

SEQ ID NO: 178 Protein ID: 151147, Augustus v.5 ID: 512347

SEQ ID NO: 179 Protein ID: 417522, Augustus v.5 ID: 520646

SEQ ID NO: 180 Protein ID: 195665, Augustus v.5 ID: 516181

SEQ ID NO: 181 Protein ID: 288478, Augustus v.5 ID: 516652

SEQ ID NO: 182 Protein ID: 136718, Augustus v.5 ID: 511112

SEQ ID NO: 183 Augustus v.5 ID: 514509

SEQ ID NO: 184 Protein ID: 206095, Augustus v.5 ID: 509796

SEQ ID NO: 185 Protein ID: 136002, Augustus v.5 ID: 521248

SEQ ID NO: 186 Protein ID: 294540, Augustus v.5 ID: 522714

SEQ ID NO: 187 Protein ID: 136100, Augustus v.5 ID: 521254

SEQ ID NO: 188 Protein ID: 342157, Augustus v.5 ID: 511033

SEQ ID NO: 189 Protein ID: 206488, Augustus v.5 ID: 520594

SEQ ID NO: 190 Protein ID: 205974, Augustus v.5 ID: 510601

SEQ ID NO: 191 Protein ID: 130473, Augustus v.5 ID: 511987

SEQ ID NO: 192 Augustus v.5 ID: 523055

SEQ ID NO: 193 Protein ID: 331285, Augustus v.5 ID: 514736

SEQ ID NO: 194 Protein ID: 187228, Augustus v.5 ID: 522979

SEQ ID NO: 195 Protein ID: 132213, Augustus v.5 ID: 520045

SEQ ID NO: 196 Protein ID: 182602, Augustus v.5 ID: 510431

SEQ ID NO: 197 Augustus v.5 ID: 523923

SEQ ID NO: 198 S1612-Protein ID: 103075, Augustus v.5 ID: 513845

SEQ ID NO: 199 S1625-Protein ID: 186846

SEQ ID NO: 200 S1644-Protein ID: 331285, Augustus v.5 ID: 514736

SEQ ID NO: 201 S1659-Protein ID: 178706

SEQ ID NO: 202 S1666-Augustus v.5 ID: 514721

SEQ ID NO: 203 S1687-Protein ID: 291633

SEQ ID NO: 204 S1693-Protein ID: 188114

SEQ ID NO: 205 S1702-Protein ID: 536097

SEQ ID NO: 206 S1704-Protein ID: 77062

SEQ ID NO: 2117 S7 amiRNA cloning fragment

SEQ ID NO: 218 S16 amiRNA cloning fragment

SEQ ID NO: 219 S65 amiRNA cloning fragment

SEQ ID NO: 210 S77 amiRNA cloning fragment

SEQ ID NO: 211 S105 amiRNA cloning fragment

SEQ ID NO: 212 S123 amiRNA cloning fragment

SEQ ID NO: 213 S129 amiRNA cloning fragment

SEQ ID NO: 214 S276 amiRNA cloning fragment

SEQ ID NO: 215 S289 amiRNA cloning fragment

SEQ ID NO: 216 S291 amiRNA cloning fragment

SEQ ID NO: 217 S292 amiRNA cloning fragment

SEQ ID NO: 218 S294 amiRNA cloning fragment

SEQ ID NO: 219 S303 amiRNA cloning fragment

SEQ ID NO: 220 S338 amiRNA cloning fragment

SEQ ID NO: 221 S1612 amiRNA cloning fragment

SEQ ID NO: 222 S1644 amiRNA cloning fragment

SEQ ID NO: 223 S1659 amiRNA cloning fragment

SEQ ID NO: 224 S1666 amiRNA cloning fragment

SEQ ID NO: 225 S1687 amiRNA cloning fragment

SEQ ID NO: 226 S1693 amiRNA cloning fragment

SEQ ID NO: 227 S1704 amiRNA cloning fragment

SEQ ID NO: 228 BD11 sequence

SEQ ID NO: 229 BD11 3′ primer to generate double stranded amiRNA cloningfragment.

SEQ ID NO: 230 S1613-Protein ID: 174261, Augustus v.5 ID: 519617

SEQ ID NO: 231 S1621-Protein ID: 206559

SEQ ID NO: 232 S1623-Protein ID: 116145, Augustus v.5 ID: 511331

SEQ ID NO: 233 S1638-Protein ID: 418706, Augustus v.5 ID: 521355

SEQ ID NO: 234 S1655-Augustus v.5 ID: 525078

SEQ ID NO: 235 S74 amiRNA cloning fragment

SEQ ID NO: 236 S1613 amiRNA cloning fragment

SEQ ID NO: 237 S1621 amiRNA cloning fragment

SEQ ID NO: 238 S1623 amiRNA cloning fragment

SEQ ID NO: 239 S1638 amiRNA cloning fragment

SEQ ID NO: 240 S1655 amiRNA cloning fragment

SEQ ID NO: 241 shows a PCR primer. See Table 3.

SEQ ID NO: 242 shows a PCR primer. See Table 3.

SEQ ID NO: 243 shows a PCR primer. See Table 3.

SEQ ID NO: 244 shows a PCR primer. See Table 3.

SEQ ID NO: 245 shows a PCR primer. See Table 3.

SEQ ID NO: 246 shows a PCR primer. See Table 3.

TABLE 1 Sequence Listing Protein ID Strain Plate Number Number NumberID# SEQ ID NO: 115 523016 (aug5) S7 S7 SEQ ID NO: 116 195781 S16 S16 SEQID NO: 122 517886 (aug5) S65 S65 SEQ ID NO: 201 178706 S1659 S59 SEQ IDNO: 129 522165 (aug5) S77 S77 SEQ ID NO: 202 514721 (aug5) S1666 S66 SEQID NO: 206 77062 S1704 S109 SEQ ID NO: 133 524679 (aug5) S105 S105 SEQID NO: 198 103075 S1612 S12 SEQ ID NO: 200 331285 S1644 S44 SEQ ID NO:204 188114 S1693 S120 SEQ ID NO: 203 291633 S1687 S114 SEQ ID NO: 135510051 (aug5) S129 S129 SEQ ID NO: 134 519822 (aug5) S123 S123 SEQ IDNO: 166 518128 (aug5) S289 S289 SEQ ID NO: 162 512487(aug5) S276 S276SEQ ID NO: 169 524030 (aug5) S292 S292 SEQ ID NO: 168 516191 (aug5) S291S291 SEQ ID NO: 170 522637 (aug5) S294 S294 SEQ ID NO: 175 512361 (aug5)S338 S338 SEQ ID NO: 127 520845 (aug5) S74 S74 SEQ ID NO: 230 174261S1613 S1613 SEQ ID NO: 231 206559 S1621 S1621 SEQ ID NO: 232 116145S1623 S1623 SEQ ID NO: 233 418706 S1638 S1638 SEQ ID NO: 234 525078(aug5) S1655 S1655 *aug5 refers to the Augustus v.5 Protein ID database.These are used because the standard annotation of the C. reinhardtiigenome does not include those genes. Augustus v.5 is generated by a geneprediction algorithm.

RNA Silencing

Chlamydomonas reinhardtii is a single-celled green alga that is an idealmodel system for studying several biological processes. Its recentlysequenced genome has advanced our understanding of the ancestraleukaryotic cell and revealed many previously unknown genes that may beassociated with photosynthetic and flagellar functions (for example, asdescribed in Merchant. S. S., et al. (2007) Science, 318, 245-250).Analysis of this genome requires a convenient system for reverse geneticanalysis.

Transposon tagging, insertional nmutagenesis and tilling have beenhighly successful reverse genetics tools in flowering plants (forexample, as described in Alonso, J. M. and Ecker, J. R. (2006) Natl.Rev. Genet., 7, 524-536), but have not yet been fully developed inChlamydomonas. Saturating entire genomes by these approaches requiresvery large mutant populations and can be limited by the selectivity ofmutational targeting. Alternative methods for high-throughput analysisof gene function are based on RNA silencing. They exploit a conservedcellular mechanism that probably evolved as a defense strategy againstviruses and transposons and that has been adopted for endogenous generegulation in many eukaryotes (for example, as described in Baulcombe,D. (2006) Short Silencing RNA: The Dark Matter of Geneics? Cold SpringHarb. Symp. Quant. Biol., LXXI, 13-20). Small RNAs (21-24 nucleotides(nt)) are central components in this process, providing sequencespecificity for the effector complexes of the silencing machinery.

There are two main classes of small RNAs in RNA silencing: smallinterfering RNAs (siRNAs) and microRNAs (miRNAs). The siRNAs areproduced from a perfectly double-stranded (ds) RNA by RNaseIII-likeenzymes (Dicer or Dicer-like), releasing several double-strandedintermediates of about 21 nt in length, with a two-nucleotide 3′overhang(for example, as described in Elbashir, S. M., et al. (2001) Genes Dev.,15, 188-200). In contrast, miRNA intermediates are released by Dicer asa 21-24-nt RNA duplex from a partly double-stranded region of animperfectly matched foldback RNA (for example, as described in Ambros,V. (2001) Cell, 107, 823-826). Each miRNA precursor typically gives riseto one predominant 21-24-nt RNA duplex whereas multiple forms of thismolecule are generated from siRNA precursors.

The short dsRNAs are processed similarly in both miRNA and siRNApathways. The strands with lower thermodynamic stability at their 5′ends are stably retained by an Argonaute (AGO) protein (for example, asdescribed in Khvorova, A., et al. (2003) Cell, 115, 209-216; andSchwarz, D. S., et al. (2003) Cell, 115, 199-208) through a mechanismthat is influenced by the 5′ nucleotide (for example, as described inMi, S., et al. (2008) Cell, 133, 116-127). The resulting AGOribonucleoprotein is the effector of silencing that is guided to itstarget nucleic acids through Watson-Crick base pairing with the boundsmall RNA. The small RNA strand that is not incorporated into theArgonaute is referred to as the passenger strand or miRNA* and israpidly degraded.

The targeting mechanisms involve transcriptional or posttranscriptionalregulation of the target sequence. The transcriptional silencingmechanism is not well understood and it has not been used in methods forfunctional analysis of genome sequences. The post-transcriptionalmechanisms, in contrast, are better understood in detail and have beenused widely. They involve translational arrest or targeted RNAdegradation, either by mRNA destabilization or miRNA guided cleavage(for example, as described in Bartel, D. P. (2004) Cell, 116, 281-297);small RNAs displaying partial complementarity to the target RNAtypically cause translational inhibition whereas those with a completeor near-complete match are more likely to direct mRNA cleavage. ThemiRNAs in animals are often complementary to their target in a shortseed region (positions 2 to 8) allowing each miRNA to target many, oftenhundreds, of mRNAs (for example, as described in Brennecke, J., et al.(2005) PloS Biology, 3, e85; Farh, K. K., et al. (2005) Science, 310,1817-1821; Lewis, B. P., et al. (2005) Cell, 120, 15-20; and Lim, L. P.,et al. (2005) Nature, 433, 769-773). In contrast, plant miRNAs have few(zero to five) mismatches to their targets and normally triggertranscript cleavage and subsequent degradation of a limited number ofmRNAs (for example, as described in Llave, C., et al. (2002) Science,297, 2053-2056; and Schwab, R., et al. (2005) Developmental Cell, 8,517-527).

An alternative to the use of long dsRNA transgenes to down-regulate agene of interest involves modified versions of endogenous miRNA (forexample, as described in Zeng, Y., et al. (2002) Molecular Cell, 9,1327-1333; Parizotto, E. A., et al. (2004) Genes Dev., 18, 1-6; Alvarez,J. P., et al. (2006) The Plant Cell, 18, 1134-1151; Niu, Q. W., et al.(2006) Natl. Biotechnol., 24, 1420-1428; Schwab, R., et al. (2006) ThePlant Cell, 18, 1121-1133; and Warthmann, N., et al. (2008) PloS ONE, 3,e1829). This artificial miRNA approach overcomes the self-silencingproblems of siRNAs because miRNAs are not normally associated withtranscriptional silencing. In addition, each artificial miRNA precursorgives rise to only a single small RNA species that can be optimized toavoid off-target effects, at least in the case of organisms withcomplete genome information.

Chlamydomonas miRNA loci can be subdivided into two categories. Those inthe ‘short hairpin’ category resemble typical miRNA loci of land plantsand animals in that the hairpin regions are shorter than 150 nt and theyspecify a single miRNA. The predicted transcripts of ‘long hairpin’ lociin Chlamydomonas can form long (150-729 nt) almost perfect hairpins,with the potential to produce multiple small RNAs (for example, asdescribed in Molnar, A., et al. (2007) Nature, 447, 1126-1129; and Zhao,T., et al. (2007) Genes Dev., 21, 1190-1203).

Artificial miRNAs (amiRNAs) can be used as a highly specific,high-throughput silencing system to verify a desired phenotype (forexample, a salt, herbicide, or bleach resistance organism) that is theresult of the expression of a candidate gene.

The present disclosure recognizes that large scale cultures of algae canbe used to produce a variety of biomolecules. The disclosed methods,constructs, algae, and cells are provided to fully realize theadvantages of algal cultures for large-scale production of usefulbiomolecules as well as for other purposes, such as, for example, carbonfixation or decontamination of compounds, solutions, or mixtures. Thepresent disclosure also recognizes the potential for algae, throughphotosynthetic carbon fixation, to convert CO₂ to sugar, starch, lipids,fats, or other biomolecules, thereby removing a greenhouse gas from theatmosphere while providing therapeutic or industrial products, a fuelproduct, or nutrients for human or animal consumption. To enable largescale growth of algal cultures in open ponds or large containers inwhich they efficiently and economically have access to CO₂ and light, itis important to deter the growth of competing organisms that mightotherwise contaminate and even overtake the culture. Provided herein arealgae in which genes have been knocked out or knocked down to confersalt tolerance, such that the algae are able to grow in the presence ofsalt at a concentration that deters growth of algae not harboring theknock out or knock down gene. The concentration of salt may also deterthe growth of other organisms, such as, but not necessarily limited to,other algal species.

Plant species, which includes algae, vary in how well they toleratesalt. Some plants will tolerate high levels of salinity while others cantolerate little or no salinity. The relative growth of plants in thepresence of salinity is termed their salt tolerance. Salt tolerance isthe ability of a modified plant or plant cell (also host cell ororganism) to display an improved response to an increase inextracellular and/or intracellular concentration of salt including, butnot limited to, Na+, Li+ and K+, as compared to an unmodified plant orplant cell. Increased salt tolerance may be manifested by phenotypiccharacteristics including longer life span, apparent normal growth andfunction of the plant, and/or a decreased level of necrosis, whensubjected to an increase in salt concentration, as compared to aunmodified plant. Salt tolerance is measured by methods known in the artsuch as those described in Inan et al. (July 2004) Plant Physiol.135:1718, including without limitation, NaCl shock exposure or gradualincrease NaCl concentration.

A transgenic algal cell of the present disclosure has increased salttolerance with respect to a wild type algal cell that does not containthe knock out or knock down gene. In some embodiments, the salttolerance is at least twice that of a wildtype alga. The salt tolerancecan be at least 1.5, 2, 2.5, 3, 3.5, 4, 5 or more than 5 fold higherthan that of a wildtype alga.

The salt used in the present disclosure can be a sodium (Na+) salt, alithium (Li+) salt, or a potassium (K+) salt. The concentration of Na+in the selection media for the transgenic algae of the presentdisclosure can be at least 200 mM. The concentration of Li+ in theselection media for the transgenic algae of the present disclosure canbe at least 2 mM,

Algae

The present disclosure provides algae and algal cells in which one ormore polynucleotides have been knocked out to confer salt tolerance.Also provided are algae and algal cells transformed with apolynucleotide encoding the Bt toxin that is lethal to some insect androtifer species. The transformed algae may be referred to herein as“host algae”.

Algae in which genes have been knocked out to provide salt tolerance asdisclosed herein can be macroalgae or microalgae. Microalgae includeeukaryotic microalgae and cyanobacteria.

An exemplary group of organisms for use in the present disclosure arespecies of the green algae (Chlorophyta). These algae are found in soil,fresh water, oceans, and even in snow on mountaintops. Algae in thisgenus have a cell wall, a chloroplast, and two anterior flagellaallowing mobility in liquid environments. More than 500 differentspecies of Chlamydomonas have been described.

The most widely used laboratory species is C. reinhardtii. When deprivedof nitrogen, C. reinhardtii cells can differentiate into isogametes. Twodistinct mating types, designated mt+ and mt−, exist. These fusesexually, thereby generating a thick-walled zygote which forms a hardouter wall that protects it from various environmental conditions. Whenrestored to nitrogen culture medium in the presence of light and water,the diploid zygospore undergoes meiosis and releases four haploid cellsthat resume the vegetative life cycle. In mitotic growth the cellsdouble as fast as every eight hours.

The nuclear genetics of C. reinhardtii is well established. There are alarge number of mutants that have been characterized and the C.reinhardtii center (www.chlamy.org) maintains an extensive collection ofmutants, as well as annotated genomic sequences of Chlamydomonasspecies. A large number of chloroplast mutants as well as severalmitochondrial mutants have been developed in C. reinhardtii.

While the methods and transformed cells are described herein with C.reinhardtii in some exemplary aspects, it is understood that the methodsand transformants described herein are also applicable to other algae,including cyanobacteria such as but not limited to Synechococcus,Synechocystis, Athrospira, Anacytis, Anabaena, Nostoc, Spirulina, andFremyella species and including green microalgae such as but not limitedto Dunaliella, Scenedesmus, Chlorella, Volvox, or Hematococcus species.

Transformed cells are produced by introducing DNA into a population oftarget cells and selecting the cells which have taken up the DNA. Insome embodiments, knock outs or knock downs that confer resistance tosalt may be grown in the presence of high salt concentrations to selectfor successful knock outs or knock downs. The knock out or knock downsequence can be introduced into an algal cell using a direct genetransfer method such as, for example, electroporation, microprojectilemediated (biolistic) transformation using a particle gun, the “glassbead method” or by cationic lipid or liposome-mediated transformation.

Nuclear transformation of eukaryotic algal cells can be bymicroprojectile mediated transformation, or can be by protoplasttransformation, electroporation, introduction of DNA using glass fibers,or the glass bead agitation method, as nonlimiting examples (Kindle.Proc. Natl. Acad. Sciences USA 87: 1228-1232 (1990); Shimogawara et al.Genetics 148: 1821-1828 (1998)). Markers for nuclear transformation ofalgae include, without limitation, markers for rescuing auxotrophicstrains (e.g., NIT1 and ARG7 in Chlamydomonas; Kindle et al. J. CellBiol. 109: 2589-2601 (1989), Debuchy et al. EMBO J. 8: 2803-2809(1989)), as well as dominant selectable markers (e.g., CRY1, aada;Nelson et al. Mol. Cellular Biol. 14: 4011-4019 (1994), Cerutti et al.Genetics 145: 97-110 (1997)). In some embodiments, the presence of theknock out or knock down is used as a selectable marker fortransformants. A knock out sequence can in some embodiments beco-transformed with a second sequence encoding a protein to be producedby the alga (for example, a therapeutic protein, industrial enzyme) or aprotein that promotes or enhances production of a commercial,therapeutic, or nutritional product. The second sequence is in someembodiments provided on the same nucleic acid construct as the knock outsequence for transformation into the alga, in which the success of theknock out sequence in activating the gene of interest is used as theselectable marker.

Several cell division cycles following transformation are generallyrequired to reach a homoplastidic state. Algae may be allowed to dividein the presence or absence of a selection agent, or under stepped-upselection (use of a lower concentration of the selective agent thanhomoplastic cells would be expected to grow on, which can be increasedover time) prior to screening transformants. Screening of transformantsby PCR or Southern hybridization, for example, can be performed todetermine whether a transformant is homoplastic or heteroplastic, and ifheteroplastic, the degree to which the recombinant gene has integratedinto copies of the chloroplast genome.

For transformation of chloroplasts, a major benefit can be theutilization of a recombinant nucleic acid construct which contains boththe knock out sequence and one or more genes of interest. Typically,transformation of chloroplasts is performed by co-transformation ofchloroplasts with two constructs: one containing knock out sequence anda second containing the gene(s) of interest. Transformants are screenedfor presence of the knock out or knock down (salt tolerance) and, insome embodiments, for the presence of (a) further gene(s) of interest.Typically, secondary screening for one or more gene(s) of interest isperformed by PCR or Southern blot (see, for example PCT/US2007/072465).

The organisms/host cells herein can be transformed to modify theproduction of a product(s) with a vector, in this case to decrease oreliminate production of a product(s). The vector is typicallysubstantially homologous to the gene to be knocked out to allow forhomologous recombination to take place, but has been modified in such away that the product normally produced by the gene is not produced, isproduced in an inactive form, or is produced in a form in which thenormal activity of the product is greatly reduced.

One approach to construction of a genetically manipulated strain of algainvolves transformation with a nucleic acid which inactivates a gene ofinterest to, for example, confer resistance to salt. In someembodiments, a transformation may introduce nucleic acids into the hostalga cell (for example, a chloroplast or nucleus of a eukaryotic hostcell). Transformed cells are typically plated on selective mediafollowing introduction of exogenous nucleic acids. This method may alsocomprise several steps for screening. Initially, a screen of primarytransformants is typically conducted to determine which clones haveproper insertion of the exogenous nucleic acids. Clones which show theproper integration may be replica plated and re-screened to ensuregenetic stability. Such methodology ensures that the genes of interesthave been knocked out or knocked down. In many instances, such screeningis performed by polymerase chain reaction (PCR); however, any otherappropriate technique known in the art may be utilized, Many differentmethods of PCR are known in the art (for example, nested PCR, real timePCR).

The entire chloroplast genome of C. reinhardtii is available as GenBankAcc. No. BK000554 and reviewed in J. Maul, et al. The Plant Cell 14:2659-2679 (2002), both incorporated by reference herein. TheChlamydomonas genome is also provided to the public on the world wideweb, at the URL “biology.duke.edu/chlamy_genome/-chloro.html” (see “viewcomplete genome as text file” link and “maps of the chloroplast genome”link), each of which is incorporated herein by reference. To create aknock out, the nucleotide sequence of the chloroplast genomic DNA isselected such that it is a portion of a gene of interest, including aregulatory sequence or coding sequence. In this respect, the websitecontaining the C. reinhardtii chloroplast genome sequence also providesmaps showing coding and non-coding regions of the chloroplast genome,thus facilitating selection of a sequence useful for constructing aknock out vector.

A knock out nucleic acid molecule may include a nucleotide sequenceencoding a reporter polypeptide or other selectable marker. The term“reporter” or “selectable marker” refers to a polynucleotide (or encodedpolypeptide) that confers a detectable phenotype. A reporter generallyencodes a detectable polypeptide, for example, a green fluorescentprotein or an enzyme such as luciferase, which, when contacted with anappropriate agent (a particular wavelength of light or luciferin,respectively) generates a signal that can be detected by eye or usingappropriate instrumentation (Giacomin, Plant Sci. 116:59-72, 1996;Scikantha, J. Bacteriol. 178:121, 1996; Gerdes, FEBS Lett. 389:44-47,1996; see, also, Jefferson, EMBO J. 6:3901-3907, 1997,fl-glucuronidase).

A selectable marker can provide a means to rapidly screen prokaryoticcells or plant cells or both that have incorporated the knock outsequence and so express the marker. Examples of selectable markersinclude, but are not limited to, those that confer antimetaboliteresistance, for example, dihydrofolate reductase, which confersresistance to methotrexate (Reiss, Plant Physiol. (Life Sci. Adv.)13:143-149, 1994); neomycin phosphotransferase, which confers resistanceto the aminoglycosides neomycin, kanamycin and paromycin(Herrera-Estrella, EMBO J. 2:987-995, 1983), hygro, which confersresistance to hygromycin (Marsh, Gene 32:481-485, 1984), trpB, whichallows cells to utilize indole in place of tryptophan; hisD, whichallows cells to utilize histinol in place of histidine (Hartman, Proc.Natl. Acad. Sci., USA 85:8047, 1988); mannose-6-phosphate isomerasewhich allows cells to utilize mannose (WO 94/20627); ornithinedecarboxylase, which confers resistance to the ornithine decarboxylaseinhibitor, 2-(difluoromethyl)-DL-ornithine (DFMO; McConlogue, 1987, in:Current Communications in Molecular Biology, Cold Spring HarborLaboratory ed.); and deaminase from Aspergillus terreus, which confersresistance to Blasticidin S (Tamura, Biosci. Biotechnol Biochem.59:2336-2338, 1995). Selectable markers include polynucleotides thatconfer dihydrofolate reductase (DHFR) or neomycin resistance foreukaryotic cells and tetracycline; ampicillin resistance for prokaryotessuch as E. coli; and bleomycin, gentamycin, glyphosate, hygromycin,kanamycin, methotrexate, phleomycin, phosphinotricin, spectinomycin,streptomycin, sulfonamide and sulfonylurea resistance in plants (see,for example, Maliga et al., Methods in Plant Molecular Biology, ColdSpring Harbor Laboratory Press, 1995, page 39).

Salt tolerance can also be a selectable marker. The host algae disclosedherein that are transformed with polynucleotides knocking out orknocking down one or more genes in order to confer resistance to saltmay be selected for with elevated salt concentrations. Alternatively, aselectable marker such as kanamycin or bleomycin or nitrate reductasemay be co-transformed with the knock out sequence, and transformed cellscan initially be selected for using a selection media or compound thatis not related to the knocked out gene.

Large scale cultures of algae bioengineered for salt tolerance can beused for the production of biomolecules, which can be therapeutic,nutritional, commercial, or fuel products, or for fixation of CO₂, orfor decontamination of compounds, mixtures, samples, or solutions. Thesalt tolerant algae provided herein can be grown in a concentration ofsalt that can impede or prevent the growth of species other than thealgal species used for bioproduction, decontamination, or CO₂ fixation.In certain embodiments the concentration of salt is 1.5, 2, 2.5, 3, 3.5,4, 4.5, 5 or greater than 5 times tolerated by the corresponding wildtype alga. In other embodiments the average concentration of Na²⁺ is 200mM or more. In still other embodiments the concentration of Li⁺ in themedium is about 2 mM or more. In certain embodiments of the disclosure,a host alga engineered to provide salt tolerance is transformed with oneor more additional genes that encodes an exogenous or endogenous proteinthat is produced by the alga when it is grown in culture, in which theexogenous or endogenous protein is a therapeutic, nutritional,commercial, or fuel product, or increases production or facilitatesisolation of a therapeutic, nutritional, commercial, or fuel product.

A salt resistant alga as provided herein may be used in some embodimentsto produce biomolecules that are endogenous or not endogenous to thealgal host. In some embodiments, the genetically engineered salttolerant algae can be cultured for environmental remediation or CO₂fixation. The algae may additionally be transformed with one or morerecombinant exogenous or endogenous polynucleotides that enable growthof the algae in the presence of at least one herbicide. Geneticengineering of algae to confer resistance to herbicides has beendescribed in U.S. patent application 61/142,091 filed Dec. 31, 2008,which in incorporated reference in its entirety.

In some embodiments, a prokaryotic alga provided herein is resistant toone or more herbicides in addition to being salt tolerant. A prokaryoticalga can include a first recombinant exogenous or endogenous herbicideresistance gene conferring resistance to a first herbicide and a secondexogenous or endogenous herbicide resistance gene conferring resistanceto a second herbicide.

The polynucleotide encoding the herbicide resistance gene can beprovided in a vector for transformation of the algal host. In someembodiments, the vector is designed for integration into the hostgenome, and can include, for example, sequences having homology to thehost genome flanking the herbicide resistance gene to promote homologousrecombination. In other embodiments, the vector can have an origin ofreplication such that it can be maintained in the host as anautonomously replicating episome. In some embodiments, theprotein-encoding sequence of the polynucleotide is codon biased toreflect the codon bias of the host alga.

The disclosure also provides a salt tolerant eukaryotic alga furthercomprising one or more recombinant polynucleotide sequences encodingproteins that confer resistance to herbicides, in which each of theproteins confers resistance to a different herbicide. In someembodiments, an herbicide resistant alga transformed with herbicideresistance genes is resistant to two or more herbicides that inhibitdifferent amino acid biosynthesis pathways, for example, glyphosate andsulfonylureas, or glyphosate and phosphinothricin. In some embodiments,an herbicide resistant alga transformed with herbicide resistance genesis resistant to two or more herbicides, in which at least one herbicideinhibits an amino acid biosynthesis pathway, and at least one herbicidedoes not inhibit an amino acid biosynthesis pathway. For example, anherbicide resistant alga can include recombinant genes conferringglyphosate resistance and resistance to norflurazon.

In some embodiments of an alga comprising two or more recombinantpolynucleotide sequences encoding proteins that confer resistance toherbicides, at least one of the recombinant polynucleotides encodes anendogenous protein conferring herbicide resistance. In some embodiments,at least one of the polynucleotides encodes an exogenous proteinconferring herbicide resistance.

Also disclosed herein are methods of producing one or more biomolecules,in which the methods include engineering an alga by knocking out one ormore genes thereby conferring salt tolerance, growing the alga in thepresence of the elevated salt concentrations, and harvesting one or morebiomolecules from the alga or algal media. The methods in someembodiments include isolating the one or more biomolecules.

The genetically engineered salt tolerant alga is grown in mediacontaining a concentration of a salt that permits growth of thetransformed alga, but inhibits growth of the same species of alga thatis not engineered to confer resistance to the salt. In some embodiments,the concentration of salt in the media in which the geneticallyengineered alga is grown to produce a biomolecule or product inhibitsthe growth of at least one other algal species. In some embodiments, theconcentration of salt in the media in which the genetically engineeredalga is grown to produce a biomolecule or product inhibits the growth ofat least one bacterial species or at least one fungal species. Theconcentration for optimal bioproduction by the host alga and inhibitionof growth of other nontransformed species can be empirically determined.

In some embodiments, genetically engineered salt tolerant algae thatinclude one or more recombinant polynucleotides encoding proteins eachconferring resistance to a different herbicide are grown in mediacontaining one or more herbicides. The one or more herbicides incombination can inhibit the growth of any combination of at least onealgal species, at least one bacterial species, and at least one fungalspecies.

A product (for example fuel product, fragrance product, insecticideproduct, commercial product, therapeutic product) may be produced by analgal culture by a method that comprises the step of growing/culturing asalt tolerant alga in media that includes elevated concentrations of oneor more salts such as NaCl or LiCl or both. The methods herein canfurther comprise the step of collecting a product produced by theorganism. The product can be the product of an exogenous nucleotidetransformed into the alga. In some embodiments, the product (for examplefuel product, fragrance product, insecticide product) is collected byharvesting the organism. The product may then be extracted from theorganism.

In one embodiment, methods are provided for producing abiomass-degrading enzyme in an alga, in which the methods includeengineering the alga to knock out one or more genes thereby conferringsalt tolerance to the alga and transforming the alga with a sequenceencoding an exogenous biomass-degrading enzyme or which promotesincreased expression of an endogenous biomass-degrading enzyme; growingthe alga in the presence of elevated concentrations of one or more saltsand under conditions which allow for production of the biomass-degradingenzyme, in which the salt is in sufficient concentration to inhibitgrowth of the alga which has not been engineered for salt tolerance, toproducing the biomass-degrading enzyme. The methods in some embodimentsinclude isolating the biomass-degrading enzyme.

In some embodiments, the expression of the product (for example fuelproduct, fragrance product, insecticide product) is inducible. Theproduct may be induced to be expressed. Expression may be inducible bylight. In yet other embodiments, the production of the product isautoregulatable. The product may form a feedback loop, wherein when theproduct (for example fuel product, fragrance product, insecticideproduct) reaches a certain level, expression of the product may beinhibited. In other embodiments, the level of a metabolite of theorganism inhibits expression of the product. For example, endogenous ATPproduced by the organism as a result of increased energy production toexpress the product, may form a feedback loop to inhibit expression ofthe product. In yet another embodiment, production of the product may beinducible, for example, by light or an exogenous agent. For example, anexpression vector for effecting production of a product in the hostorganism may comprise an inducible regulatory control sequence that isactivated or inactivated by an exogenous agent.

The methods herein may further comprise the step of providing to theorganism a source of inorganic carbons, such as flue gas. In someinstances, the inorganic carbon source provides all of the carbonsnecessary for making the product (for example, fuel product). Thegrowing/culturing step can occur in a suitable medium, such as one thathas minerals and/or vitamins in addition to elevated concentrations ofone or more salts.

The methods herein comprise selecting genes that are useful to produceproducts, such as fuels, fragrances, therapeutic compounds, andinsecticides, transforming genetically engineered salt tolerant algaewith such gene(s), and growing such algae in the presence of elevatedconcentrations of one or more salts under conditions suitable to allowthe product to be produced. Organisms can be cultured in conventionalfermentation bioreactors, which include, but are not limited to, batch,fed-batch, cell recycle, and continuous fermentors. Further, they may begrown in photobioreactors (see for example US Appl. Publ. No.20050260553; U.S. Pat. Nos. 5,958,761; 6,083,740). Culturing can also beconducted in shake flasks, test tubes, microtiter dishes, and petriplates. Culturing is carried out at a temperature, pH and oxygen contentappropriate for the recombinant cell and at a salt concentration thatpermits growth and bioproduction by the algae.

The genetically engineered, salt tolerant algae and methods providedherein can expand the culturing conditions of the algae to larger areasthat may be open and, in the absence of resistance, subject tocontamination of the culture, for example, on land, such as inlandfills. In some cases, organism(s) are grown near ethanol productionplants or other facilities or regions (for example, cities, highways,etc.) generating CO₂. As such, the methods herein contemplate businessmethods for selling carbon credits to ethanol plants or other facilitiesor regions generating CO₂ while making fuels by growing one or more ofthe modified organisms described herein in the presence of elevatedconcentrations of one or more salts.

Host Cells or Host Organisms

Biomass useful in the methods and systems described herein can beobtained from host cells or host organisms that have been modified (e.g.genetically engineered) to be, for example, salt tolerant, herbicideresistant, or sodium hypochlorite resistant, as compared to anunmodified organism. In addition, the host cells or host organism can befurther modified to express an exogenous or endogenous protein, such asa protein involved in the isoprenoid biosynthetic pathway or a proteininvolved in the accumulation and/or secretion of fatty acids, glycerollipids, or oils.

A host cell can contain a polynucleotide encoding a polypeptide of thepresent disclosure. In some embodiments, a host cell is part of amulticellular organism. In other embodiments, a host cell is cultured asa unicellular organism.

Host organisms can include any suitable host, for example, amicroorganism. Microorganisms which are useful for the methods describedherein include, for example, photosynthetic bacteria (e.g.,cyanobacteria), non-photosynthetic bacteria (e.g., E. coli), yeast(e.g., Saccharomyces cerevisiae), and algae (e.g., microalgae such asChlamydomonas reinhardtii).

Examples of host organisms that can be transformed with a polynucleotideof interest (for example, a polynucleotide that encodes a proteininvolved in the isoprenoid biosynthesis pathway) include vascular andnon-vascular organisms. The organism can be prokaryotic or eukaryotic.The organism can be unicellular or multicellular. A host organism is anorganism comprising a host cell. In other embodiments, the host organismis photosynthetic. A photosynthetic organism is one that naturallyphotosynthesizes (e.g., an alga) or that is genetically engineered orotherwise modified to be photosynthetic. In some instances, aphotosynthetic organism may be transformed with a construct or vector ofthe disclosure which renders all or part of the photosynthetic apparatusinoperable.

By way of example, a non-vascular photosynthetic microalga species (forexample, C. reinhardtii, Nannochloropsis oceania, N. salina, D. salina,H. pluvalis, S. dimorphus, D. viridis, Chlorella sp., and D. teriolecta)can be genetically engineered to produce a polypeptide of interest, forexample a fusicoccadiene synthase or an FPP synthase. Production of afusicoccadiene synthase or an FPP synthase in these microalgae can beachieved by engineering the microalgae to express the fusicoccadienesynthase or FPP synthase in the algal chloroplast or nucleus.

In other embodiments the host organism is a vascular plant. Non-limitingexamples of such plants include various monocots and dicots, includinghigh oil seed plants such as high oil seed Brassica (e.g., Brassicanigra, Brassica napus, Brassica hirta, Brassica rapa, Brassicacampestris, Brassica carinata, and Brassica juncea), soybean (Glycinemax), castor bean (Ricinus communis), cotton, safflower (Carthamustinctorius), sunflower (Helianthus annuus), flax (Linum usitatissimum),corn (Zea mays), coconut (Cocos nucifera), palm (Elaeis guineensis), oilnut trees such as olive (Olea europaea), sesame, and peanut (Arachishipogaea), as well as Arabidopsis, tobacco, wheat, barley, oats,amaranth, potato, rice, tomato, and legumes (e.g., peas, beans, lentils,alfalfa, etc.).

The host cell can be prokaryotic. Examples of some prokaryotic organismsof the present disclosure include, but are not limited to, cyanobacteria(e.g., Synechococcus, Synechocystis, Athrospira, Anacytis, Anabaena,Nostoc, Spirulina, Fremyella, Gleocapsa, Oscillatoria, and,Pseudoanabaena). Suitable prokaryotic cells include, but are not limitedto, any of a variety of laboratory strains of Escherichia coli,Lactobacillus sp., Salmonella sp., and Shigella sp. (for example, asdescribed in Carrier et al. (1992) J. Immunol. 148:1176-1181; U.S. Pat.No. 6,447,784; and Sizemore et al. (1995) Science 270:299-302). Examplesof Salmonella strains which can be employed in the present disclosureinclude, but are not limited to, Salmonella typhi and S. typhimurium.Suitable Shigella strains include, but are not limited to, Shigellaflexneri, Shigella sonnei, and Shigella disenteriae. Typically, thelaboratory strain is one that is non-pathogenic. Non-limiting examplesof other suitable bacteria include, but are not limited to, Pseudomonaspudita, Pseudomonas aeruginosa, Pseudomonas mevalonii, Rhodobactersphaeroides, Rhodobacter capsulatus, Rhodospirillum rubrum, andRhodococcus sp.

In some embodiments, the host organism is eukaryotic (e.g. green algae,red algae, brown algae). In some embodiments, the algae is a greenalgae, for example, a Chlorophycean. The algae can be unicellular ormulticellular. Suitable eukaryotic host cells include, but are notlimited to, yeast cells, insect cells, plant cells, fungal cells, andalgal cells. Suitable eukaryotic host cells include, but are not limitedto, Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichiakoclamae, Pichia membranaefaciens, Pichia opuntiae, Pichiathermotolerans, Pichia salictaria, Pichia guercuum, Pichia pijperi,Pichia stiptis, Pichia methanolica, Pichia sp., Saccharomycescerevisiae, Saccharomyces sp., Hansenula polymorpha, Kluyveromyces sp.,Kluyveromyces lactis, Candida albicans, Aspergillus nidulans,Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, Chrysosporiumlucknowense, Fusarium sp., Fusarium gramineum, Fusarium venenatum,Neurospora crassa, and Chlamydomonas reinhardtii.

In some embodiments, eukaryotic microalgae, such as for example, aChlamydomonas, Volvacales, Dunaliella, Scenedesmus, Chlorella, orHematococcus species, are used in the disclosed methods. In otherembodiments, the host cell is Chlamydomonas reinhardtii, Dunaliellasalina, Haematococcus pluvialis, Nannochloropsis oceania, N. salina.Scenedesmus dimorphus, Chlorella spp., D. viridis, or D. tertiolecta.

In some instances the organism is a rhodophyte, chlorophyte,heterokontophyte, tribophyte, glaucophyte, chlorarachniophyte,euglenoid, haptophyte, cryptoronad, dinoflagellum, or phytoplankon.

In some instances a host organism is vascular and photosynthetic.Examples of vascular plants include, but are not limited to,angiosperms, gymnosperms, rhyniophytes, or other tracheophytes.

In some instances a host organism is non-vascular and photosynthetic. Asused herein, the term “non-vascular photosynthetic organism,” refers toany macroscopic or microscopic organism, including, but not limited to,algae, cyanobacteria and photosynthetic bacteria, which does not have avascular system such as that found in vascular plants. Examples ofnon-vascular photosynthetic organisms include bryophtyes, such asmarchantiophytes or anthocerotophytes. In some instances the organism isa cyanobacteria. In some instances, the organism is algae (e.g.,macroalgae or microalgae). The algae can be unicellular or multicellularalgae. For example, the microalgae Chlamydomonas reinhardtii may betransformed with a vector, or a linearized portion thereof, encoding oneor more proteins of interest (e.g., a protein involved in the isoprenoidbiosynthesis pathway).

Methods for algal transformation are described in U.S. ProvisionalPatent Application No. 60/142,091. The methods of the present disclosurecan be carried out using algae, for example, the microalga, C.reinhardtii. The use of microalgae to express a polypeptide or proteincomplex according to a method of the disclosure provides the advantagethat large populations of the microalgae can be grown, includingcommercially (Cyanotech Corp.; Kailua-Kona Hi.), thus allowing forproduction and, if desired, isolation of large amounts of a desiredproduct.

The vectors of the present disclosure may be capable of stable ortransient transformation of multiple photosynthetic organisms,including, but not limited to, photosynthetic bacteria (includingcyanobacteria), cyanophyta, prochlorophyta, rhodophyta, chlorophyta,pyrrophyta, heterokontophyta, tribophyta, glaucophyta,chlorarachniophytes, euglenophyta, euglenoids, haptophyta, chrysophyta(including diatoms), cryptophyta, cryptomonads, dinophyta,dinoflagellata, pyrmnesiophyta, bacillariophyta, xanthophyta,eustigmatophyta, raphidophyta, phaeophyta, and phytoplankton. Othervectors of the present disclosure are capable of stable or transienttransformation of, for example, C. reinhardtii, N. oceania, N salina, D.salina, H. pluvalis, S. dimorphus, D. viridis, or D. tertiolecta.

Examples of appropriate hosts, include but are not limited to: bacterialcells, such as E. coli, Streptomyces, Salmonella typhimurium; fungalcells, such as yeast; insect cells, such as Drosophila S2 and SpodopteraSf9; animal cells, such as CHO, COS or Bowes melanoma; adenoviruses; andplant cells. The selection of an appropriate host is deemed to be withinthe scope of those skilled in the art.

Polynucleotides selected and isolated as described herein are introducedinto a suitable host cell. A suitable host cell is any cell which iscapable of promoting recombination and/or reductive reassortment. Theselected polynucleotides can be, for example, in a vector which includesappropriate control sequences. The host cell can be, for example, ahigher eukaryotic cell, such as a mammalian cell, or a lower eukaryoticcell, such as a yeast cell, or the host cell can be a prokaryotic cell,such as a bacterial cell. Introduction of a construct (vector) into thehost cell can be effected by, for example, calcium phosphatetransfection, DEAE-Dextran mediated transfection, or electroporation.

Recombinant polypeptides, including protein complexes, can be expressedin plants, allowing for the production of crops of such plants and,therefore, the ability to conveniently produce large amounts of adesired product. Accordingly, the methods of the disclosure can bepracticed using any plant, including, for example, microalga andmacroalgae, (such as marine algae and seaweeds), as well as plants thatgrow in soil.

In one embodiment, the host cell is a plant. The term “plant” is usedbroadly herein to refer to a eukaryotic organism containing plastids,such as chloroplasts, and includes any such organism at any stage ofdevelopment, or to part of a plant, including a plant cutting, a plantcell, a plant cell culture, a plant organ, a plant seed, and a plantlet.A plant cell is the structural and physiological unit of the plant,comprising a protoplast and a cell wall. A plant cell can be in the formof an isolated single cell or a cultured cell, or can be part of higherorganized unit, for example, a plant tissue, plant organ, or plant.Thus, a plant cell can be a protoplast, a gamete producing cell, or acell or collection of cells that can regenerate into a whole plant. Assuch, a seed, which comprises multiple plant cells and is capable ofregenerating into a whole plant, is considered plant cell for purposesof this disclosure. A plant tissue or plant organ can be a seed,protoplast, callus, or any other groups of plant cells that is organizedinto a structural or functional unit. Particularly useful parts of aplant include harvestable parts and parts useful for propagation ofprogeny plants. A harvestable part of a plant can be any useful part ofa plant, for example, flowers, pollen, seedlings, tubers, leaves, stems,fruit, seeds, and roots. A part of a plant useful for propagationincludes, for example, seeds, fruits, cuttings, seedlings, tubers, androotstocks.

A method of the disclosure can generate a plant containing genomic DNA(for example, a nuclear and/or plastid genomic DNA) that is geneticallymodified to contain a stably integrated polynucleotide (for example, asdescribed in Hager and Bock, Appl. Microbiol. Biotechnol. 54:302-310,2000). Accordingly, the present disclosure further provides a transgenicplant, e.g. C. reinhardtii, which comprises one or more chloroplastscontaining a polynucleotide encoding one or more exogenous or endogenouspolypeptides, including polypeptides that can allow for secretion offuel products and/or fuel product precursors (e.g., isoprenoids, fattyacids, lipids, triglycerides). A photosynthetic organism of the presentdisclosure comprises at least one host cell that is modified togenerate, for example, a fuel product or a fuel product precursor.

Some of the host organisms useful in the disclosed embodiments are, forexample, are extremophiles, such as hyperthermophiles, psychrophiles,psychrotrophs, halophiles, barophiles and acidophiles. Some of the hostorganisms which may be used to practice the present disclosure arehalophilic (e.g., Dunaliella salina, D. viridis, or D. tertiolecta). Forexample, D. salina can grow in ocean water and salt lakes (for example,salinity from 30-300 parts per thousand) and high salinity media (e.g.,artificial seawater medium, seawater nutrient agar, brackish watermedium, and seawater medium). In some embodiments of the disclosure, ahost cell expressing a protein of the present disclosure can be grown ina liquid environment which is, for example, 0.1, 0.2, 0.3, 0.4, 0.5,0.6, 0.7, 0.8, 0.9, 1.0, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9,2.0, 2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3.0, 3.1, 3.2, 3.3,3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 4.0, 4.1, 4.2, 4.3 molar or higherconcentrations of sodium chloride. One of skill in the art willrecognize that other salts (sodium salts, calcium salts, potassiumsalts, or other salts) may also be present in the liquid environments.

Where a halophilic organism is utilized for the present disclosure, itmay be transformed with any of the vectors described herein. Forexample, D. salina may be transformed with a vector which is capable ofinsertion into the chloroplast or nuclear genome and which containsnucleic acids which encode a protein (e.g., an FPP synthase or afusicoccadiene synthase). Transformed halophilic organisms may then begrown in high-saline environments (e.g., salt lakes, salt ponds, andhigh-saline media) to produce the products (e.g., lipids) of interest.Isolation of the products may involve removing a transformed organismfrom a high-saline environment prior to extracting the product from theorganism. In instances where the product is secreted into thesurrounding environment, it may be necessary to desalinate the liquidenvironment prior to any further processing of the product.

The present disclosure further provides compositions comprising agenetically modified host cell. A composition comprises a geneticallymodified host cell; and will in some embodiments comprise one or morefurther components, which components are selected based in part on theintended use of the genetically modified host cell. Suitable componentsinclude, but are not limited to, salts; buffers; stabilizers;protease-inhibiting agents; cell membrane- and/or cell wall-preservingcompounds, e.g., glycerol and dimethylsulfoxide; and nutritional mediaappropriate to the cell.

For the production of a protein, for example, an isoprenoid orisoprenoid precursor compound, a host cell can be, for example, one thatproduces, or has been genetically modified to produce, one or moreenzymes in a prenyl transferase pathway and/or a mevalonate pathwayand/or an isoprenoid biosynthetic pathway. In some embodiments, the hostcell is one that produces a substrate of a prenyl transferase,isoprenoid synthase or mevalonate pathway enzyme.

In some embodiments, a genetically modified host cell is a host cellthat comprises an endogenous mevalonate pathway and/or isoprenoidbiosynthetic pathway and/or prenyl transferase pathway. In otherembodiments, a genetically modified host cell is a host cell that doesnot normally produce mevalonate or IPP via a mevalonate pathway, or FPP,GPP or GGPP via a prenyl transferase pathway, but has been geneticallymodified with one or more polynucleotides comprising nucleotidesequences encoding one or more mevalonate pathway, isoprenoid synthasepathway or prenyl transferase pathway enzymes (for example, as describedin U.S. Patent Publication No. 2004/005678; U.S. Patent Publication No.2003/0148479; and Martin et al. (2003) Natl. Biotech. 21(7):796-802).

Culturing of Cells or Organisms

An organism may be grown under conditions which permit photosynthesis,however, this is not a requirement (e.g., a host organism may be grownin the absence of light). In some instances, the host organism may begenetically modified in such a way that its photosynthetic capability isdiminished or destroyed. In growth conditions where a host organism isnot capable of photosynthesis (e.g., because of the absence of lightand/or genetic modification), typically, the organism will be providedwith the necessary nutrients to support growth in the absence ofphotosynthesis. For example, a culture medium in (or on) which anorganism is grown, may be supplemented with any required nutrient,including an organic carbon source, nitrogen source, phosphorous source,vitamins, metals, lipids, nucleic acids, micronutrients, and/or anorganism-specific requirement. Organic carbon sources include any sourceof carbon which the host organism is able to metabolize including, butnot limited to, acetate, simple carbohydrates (e.g., glucose, sucrose,and lactose), complex carbohydrates (e.g., starch and glycogen),proteins, and lipids. One of skill in the art will recognize that notall organisms will be able to sufficiently metabolize a particularnutrient and that nutrient mixtures may need to be modified from oneorganism to another in order to provide the appropriate nutrient mix.

Optimal growth of organisms occurs usually at a temperature of about 20°C. to about 25° C., although some organisms can still grow at atemperature of up to about 35° C. Active growth is typically performedin liquid culture. If the organisms are grown in a liquid medium and areshaken or mixed, the density of the cells can be anywhere from about 1to 5×10⁸ cells/ml at the stationary phase. For example, the density ofthe cells at the stationary phase for Chlamydomonas sp. can be about 1to 5×10⁷ cells/ml; the density of the cells at the stationary phase forNannochloropsis sp, can be about 1 to 5×10⁸ cells/ml; the density of thecells at the stationary phase for Scenedesmus sp. can be about 1 to5×10⁷ cells/ml; and the density of the cells at the stationary phase forChlorella sp. can be about 1 to 5×10⁸ cells/ml. Exemplary cell densitiesat the stationary phase are as follows: Chlamydomonas sp. can be about1×10⁷ cells/ml; Nannochloropsis sp. can be about 1×10⁸ cells/ml;Scenedesmus sp. can be about 1×10⁷ cells/ml; and Chlorella sp. can beabout 1×10⁸ cells/ml. An exemplary growth rate may yield, for example, atwo to four fold increase in cells per day, depending on the growthconditions. In addition, doubling times for organisms can be, forexample, 5 hours to 30 hours. The organism can also be grown on solidmedia, for example, media containing about 1.5% agar, in plates or inslants.

One source of energy is fluorescent light that can be placed, forexample, at a distance of about 1 inch to about two feet from theorganism. Examples of types of fluorescent lights includes, for example,cool white and daylight. Bubbling with air or CO₂ improves the growthrate of the organism. Bubbling with CO₂ can be, for example, at 1% to 5%CO₂. If the lights are turned on and off at regular intervals (forexample, 12:12 or 14:10 hours of light:dark) the cells of some organismswill become synchronized.

Long term storage of organisms can be achieved by streaking them ontoplates, sealing the plates with, for example, Parafilm™, and placingthem in dim light at about 10° C. to about 18° C. Alternatively,organisms may be grown as streaks or stabs into agar tubes, capped, andstored at about 10° C. to about 18° C. Both methods allow for thestorage of the organisms for several months.

For longer storage, the organisms can be grown in liquid culture to midto late log phase and then supplemented with a penetratingcryoprotective agent like DMSO or MeOH, and stored at less than −130° C.An exemplary range of DMSO concentrations that can be used is 5 to 8%.An exemplary range of MeOH concentrations that can be used is 3 to 9%.

For longer Organisms can be grown on a defined minimal medium (forexample, high salt medium (HSM), modified artificial sea water medium(MASM), or F/2 medium) with light as the sole energy source. In otherinstances, the organism can be grown in a medium (for example, trisacetate phosphate (TAP) medium), and supplemented with an organic carbonsource. Organisms, such as algae, can grow naturally in fresh water ormarine water. Culture media for freshwater algae can be, for example,synthetic media, enriched media, soil water media, and solidified media,such as agar. Various culture media have been developed and used for theisolation and cultivation of fresh water algae and are described inWatanabe. M. W. (2005). Freshwater Culture Media. In R. A. Andersen(Ed.), Algal Culturing Techniques (pp. 13-20). Elsevier Academic Press.Culture media for marine algae can be, for example, artificial seawatermedia or natural seawater media. Guidelines for the preparation of mediaare described in Harrison, P. J. and Berges, J. A. (2005). MarineCulture Media. In R. A. Andersen (Ed.), Algal Culturing Techniques (pp.21-33). Elsevier Academic Press.

Organisms may be grown in outdoor open water, such as ponds, the ocean,seas, rivers, waterbeds, marshes, shallow pools, lakes, aqueducts, andreservoirs. When grown in water, the organism can be contained in ahalo-like object comprised of lego-like particles. The halo-like objectencircles the organism and allows it to retain nutrients from the waterbeneath while keeping it in open sunlight.

In some instances, organisms can be grown in containers wherein eachcontainer comprises one or two organisms, or a plurality of organisms.The containers can be configured to float on water. For example, acontainer can be filled by a combination of air and water to make thecontainer and the organism(s) in it buoyant. An organism that is adaptedto grow in fresh water can tins be grown in salt water (i.e., the ocean)and vice versa. This mechanism allows for automatic death of theorganism if there is any damage to the container.

Culturing techniques for algae are well know to one of skill in the artand are described, for example, in Freshwater Culture Media. In R. A.Andersen (Ed.), Algal Culturing Techniques. Elsevier Academic Press.

Because photosynthetic organisms, for example, algae, require sunlight,CO₂ and water for growth, they can be cultivated in, for example, openponds and lakes. However, these open systems are more vulnerable tocontamination than a closed system. One challenge with using an opensystem is that the organism of interest may not grow as quickly as apotential invader. This becomes a problem when another organism invadesthe liquid environment in which the organism of interest is growing, andthe invading organism has a faster growth rate and takes over thesystem.

In addition, in open systems there is less control over watertemperature, CO₂ concentration, and lighting conditions. The growingseason of the organism is largely dependent on location and, aside fromtropical areas, is limited to the warmer months of the year. Inaddition, in an open system, the number of different organisms that canbe grown is limited to those that are able to survive in the chosenlocation. An open system, however, is cheaper to set up and/or maintainthan a closed system.

Another approach to growing an organism is to use a semi-closed system,such as covering the pond or pool with a structure, for example, a“greenhouse-type” structure. While this can result in a smaller system,it addresses many of the problems associated with an open system. Theadvantages of a semi-closed system are that it can allow for a greaternumber of different organisms to be grown, it can allow for an organismto be dominant over an invading organism by allowing the organism ofinterest to out compete the invading organism for nutrients required forits growth, and it can extend the growing season for the organism. Forexample, if the system is heated, the organism can grow year round.

A variation of the pond system is an artificial pond, for example, araceway pond. In these ponds, the organism, water, and nutrientscirculate around a “racetrack.” Paddlewheels provide constant motion tothe liquid in the racetrack, allowing for the organism to be circulatedback to the surface of the liquid at a chosen frequency. Paddlewheelsalso provide a source of agitation and oxygenate the system. Theseraceway ponds can be enclosed, for example, in a building or agreenhouse, or can be located outdoors.

Raceway ponds are usually kept shallow because the organism needs to beexposed to sunlight, and sunlight can only penetrate the pond water to alimited depth. The depth of a raceway pond can be, for example, about 4to about 12 inches. In addition, the volume of liquid that can becontained in a raceway pond can be, for example, about 200 liters toabout 600,000 liters.

The raceway ponds can be operated in a continuous manner, with, forexample, CO₂ and nutrients being constantly fed to the ponds, whilewater containing the organism is removed at the other end.

If the raceway pond is placed outdoors, there are several different waysto address the invasion of an unwanted organism. For example, the pH orsalinity of the liquid in which the desired organism is in can be suchthat the invading organism either slows down its growth or dies.

Also, chemicals can be added to the liquid, such as bleach, or apesticide can be added to the liquid, such as glyphosate. In addition,the organism of interest can be genetically modified such that it isbetter suited to survive in the liquid environment. Any one or more ofthe above strategies can be used to address the invasion of an unwantedorganism.

Alternatively, organisms, such as algae, can be grown in closedstructures such as photobioreactors, where the environment is understricter control than in open systems or semi-closed systems. Aphotobioreactor is a bioreactor which incorporates some type of lightsource to provide photonic energy input into the reactor. The termphotobioreactor can refer to a system closed to the environment andhaving no direct exchange of gases and contaminants with theenvironment. A photobioreactor can be described as an enclosed,illuminated culture vessel designed for controlled biomass production ofprototrophic liquid cell suspension cultures. Examples ofphotobioreactors include, for example, glass containers, plastic tubes,tanks, plastic sleeves, and bags. Examples of light sources that can beused to provide the energy required to sustain photosynthesis include,for example, fluorescent bulbs, LEDs, and natural sunlight. Becausethese systems are closed everything that the organism needs to grow (forexample, carbon dioxide, nutrients, water, and light) must be introducedinto the bioreactor.

Photobioreactors, despite the costs to set up and maintain them, haveseveral advantages over open systems, they can, for example, prevent orminimize contamination, permit axenic organism cultivation ofmonocultures (a culture consisting of only one species of organism),offer better control over the culture conditions (for example, pH,light, carbon dioxide, and temperature), prevent water evaporation,lower carbon dioxide losses due to out gassing, and permit higher cellconcentrations.

On the other hand, certain requirements of photobioreactors, such ascooling, mixing, control of oxygen accumulation and biofouling, makethese systems more expensive to build and operate than open systems orsemi-closed systems.

Photobioreactors can be set up to be continually harvested (as is withthe majority of the larger volume cultivation systems), or harvested onebatch at a time (for example, as with polyethlyene bag cultivation). Abatch photobioreactor is set up with, for example, nutrients, anorganism (for example, algae), and water, and the organism is allowed togrow until the batch is harvested. A continuous photobioreactor can beharvested, for example, either continually, daily, or at fixed timeintervals.

High density photobioreactors are described in, for example, Lee, etal., Biotech. Bioengineering 44:1161-1167, 1994. Other types ofbioreactors, such as those for sewage and waste water treatments, aredescribed in, Sawayamra, et al., Appl. Micro. Biotech., 41:729-731,1994. Additional examples of photobioreactors are described in, U.S.Appl. Pub., No. 2005/0260553, U.S. Pat. Nos. 5,958,761, and 6,083,740.Also, organisms, such as algae may be mass-cultured for the removal ofheavy metals (for example, as described in Wilkinson, Biotech, Letters,11:861-864, 1989), hydrogen (for example, as described in U.S. PatentApplication Publication No. 2003/0162273), and pharmaceutical compoundsfrom a water, soil, or other source or sample. Organisms can also becultured in conventional fermentation bioreactors, which include, butare not limited to, batch, fed-batch, cell recycle, and continuousfermentors. Additional methods of culturing organisms and variations ofthe methods described herein are known to one of skill in the art.

Organisms can also be grown near ethanol production plants or otherfacilities or regions (e.g., cities and highways) generating CO₂. Assuch, the methods herein contemplate business methods for selling carboncredits to ethanol plants or other facilities or regions generating CO₂while making fuels or fuel products by growing one or more of theorganisms described herein near the ethanol production plant, facility,or region.

The organism of interest, grown in any of the systems described herein,can be, for example, continually harvested, or harvested one batch at atime.

CO₂ can be delivered to any of the systems described herein, forexample, by bubbling in CO₂ from under the surface of the liquidcontaining the organism. Also, sparges can be used to inject CO₂ intothe liquid. Spargers are, for example, porous disc or tube assembliesthat are also referred to as Bubblers, Carbonators, Aerators, PorousStones and Diffusers.

Nutrients that can be used in the systems described herein include, forexample, nitrogen (in the form of NO₃ ⁻ or NH₄ ⁺), phosphorus, and tracemetals (Fe, Mg, K, Ca, Co, Cu, Mn, Mo, Zn, V, and B). The nutrients cancome, for example, in a solid form or in a liquid form. If the nutrientsare in a solid form they can be mixed with, for example, fresh or saltwater prior to being delivered to the liquid containing the organism, orprior to being delivered to a photobioreactor.

Organisms can be grown in cultures, for example large scale cultures,where large scale cultures refers to growth of cultures in volumes ofgreater than about 6 liters, or greater than about 10 liters, or greaterthan about 20 liters. Large scale growth can also be growth of culturesin volumes of 50 liters or more, 100 liters or more, or 200 liters ormore. Large scale growth can be growth of cultures in, for example,ponds, containers, vessels, or other areas, where the pond, container,vessel, or area that contains the culture is for example, at lease 5square meters, at least 10 square meters, at least 200 square meters, atleast 500 square meters, at least 1,500 square meters, at least 2,500square meters, in area, or greater.

Chlamydomonas sp., Nannochloropsis sp., Scenedesmus sp., and Chlorellasp. are exemplary algae that can be cultured as described herein and cangrow under a wide array of conditions.

One organism that can be cultured as described herein is a commonly usedlaboratory species C. reinhardtii. Cells of this species are haploid,and can grow on a simple medium of inorganic salts, using photosynthesisto provide energy. This organism can also grow in total darkness ifacetate is provided as a carbon source. C. reinhardtii can be readilygrown at room temperature under standard fluorescent lights. Inaddition, the cells can be synchronized by placing them on a light-darkcycle. Other methods of culturing C. reinhardtii cells are known to oneof skill in the art.

Polynucleotides and Polypeptides

In addition to being genetically engineered to be, for example, salttolerant, herbicide resistant, or sodium hypochlorite resistant, ascompared to an unengineered organism, the host cells or host organismcan be further modified to express an exogenous or endogenous protein,for example, a protein involved in the isoprenoid biosynthetic pathwayor a protein involved in the accumulation and/or secretion of fattyacids, glycerol lipids, or oils.

Also provided are isolated polynucleotides encoding a protein, forexample, an FPP synthase, described herein. As used herein “isolatedpolynucleotide” means a polynucleotide that is free of one or both ofthe nucleotide sequences which flank the polynucleotide in thenaturally-occurring genome of the organism from which the polynucleotideis derived. The term includes, for example, a polynucleotide or fragmentthereof that is incorporated into a vector or expression cassette; intoan autonomously replicating plasmid or virus; into the genomic DNA of aprokaryote or eukaryote; or that exists as a separate moleculeindependent of other polynucleotides. It also includes a recombinantpolynucleotide that is part of a hybrid polynucleotide, for example, oneencoding a polypeptide sequence.

The proteins of the present disclosure can be made by any method knownin the art. The protein may be synthesized using either solid-phasepeptide synthesis or by classical solution peptide synthesis also knownas liquid-phase peptide synthesis. Using Val-Pro-Pro, Enalapril andLisinopril as starting templates, several series of peptide analogs suchas X-Pro-Pro, X-Ala-Pro, and X-Lys-Pro, wherein X represents any aminoacid residue, may be synthesized using solid-phase or liquid-phasepeptide synthesis. Methods for carrying out liquid phase synthesis oflibraries of peptides and oligonucleotides coupled to a solubleoligomeric support have also been described. Bayer, Ernst and Mutter,Manfred, Nature 237:512-513 (1972); Bayer, Ernst, et al., J. Am. Chem.Soc. 96:7333-7336 (1974); Bonora, Gian Maria, et al., Nucleic Acids Res.18:3155-3159 (1990). Liquid phase synthetic methods have the advantageover solid phase synthetic methods in that liquid phase synthesismethods do not require a structure present on a first reactant which issuitable for attaching the reactant to the solid phase. Also, liquidphase synthesis methods do not require avoiding chemical conditionswhich may cleave the bond between the solid phase and the first reactant(or intermediate product). In addition, reactions in a homogeneoussolution may give better yields and more complete reactions than thoseobtained in heterogeneous solid phase/liquid phase systems such as thosepresent in solid phase synthesis.

In oligomer-supported liquid phase synthesis the growing product isattached to a large soluble polymeric group. The product from each stepof the synthesis can then be separated from unreacted reactants based onthe large difference in size between the relatively largepolymer-attached product and the unreacted reactants. This permitsreactions to take place in homogeneous solutions, and eliminates tediouspurification steps associated with traditional liquid phase synthesis.Oligomer-supported liquid phase synthesis has also been adapted toautomatic liquid phase synthesis of peptides. Bayer, Ernst, et al.,Peptides: Chemistry, Structure, Biology, 426-432.

For solid-phase peptide synthesis, the procedure entails the sequentialassembly of the appropriate amino acids into a peptide of a desiredsequence while the end of the growing peptide is linked to an insolublesupport. Usually, the carboxyl terminus of the peptide is linked to apolymer from which it can be liberated upon treatment with a cleavagereagent. In a common method, an amino acid is bound to a resin particle,and the peptide generated in a stepwise manner by successive additionsof protected amino acids to produce a chain of amino acids.Modifications of the technique described by Merrifield are commonlyused. See, e.g., Merrifield, J. Am. Chem. Soc. 96: 2989-93 (1964). In anautomated solid-phase method, peptides are synthesized by loading thecarboxy-terminal amino acid onto an organic linker (e.g., PAM,4-oxymethylphenylacetamidomethyl), which is covalently attached to aninsoluble polystyrene resin cross-linked with divinyl benzene. Theterminal amine may be protected by blocking with t-butyloxycarbonyl.Hydroxyl- and carboxyl-groups are commonly protected by blocking withO-benzyl groups. Synthesis is accomplished in an automated peptidesynthesizer, such as that available from Applied Biosystems (FosterCity, Calif.). Following synthesis, the product may be removed from theresin. The blocking groups are removed by using hydrofluoric acid ortrifluoromethyl sulfonic acid according to established methods. Aroutine synthesis may produce 0.5 mmole of peptide resin. Followingcleavage and purification, a yield of approximately 60 to 70% istypically produced. Purification of the product peptides is accomplishedby, for example, crystallizing the peptide from an organic solvent suchas methyl-butyl ether, then dissolving in distilled water, and usingdialysis (if the molecular weight of the subject peptide is greater thanabout 500 daltons) or reverse high pressure liquid chromatography (e.g.,using a C¹⁸ column with 0.1% trifluoroacetic acid and acetonitrile assolvents) if the molecular weight of the peptide is less than 500daltons. Purified peptide may be lyophilized and stored in a dry stateuntil use. Analysis of the resulting peptides may be accomplished usingthe common methods of analytical high pressure liquid chromatography(HPLC) and electrospray mass spectrometry (ES-MS).

In other cases, a protein, for example, a protein involved in theisoprenoid biosynthesis pathway or in fatty acid synthesis, is producedby recombinant methods. For production of any of the proteins describedherein, host cells transformed with an expression vector containing thepolynucleotide encoding such a protein can be used. The host cell can bea higher eukaryotic cell, such as a mammalian cell, or a lowereukaryotic cell such as a yeast or algal cell, or the host can be aprokaryotic cell such as a bacterial cell. Introduction of theexpression vector into the host cell can be accomplished by a variety ofmethods including calcium phosphate transfection, DEAE-dextran mediatedtransfection, polybrene, protoplast fusion, liposomes, directmicroinjection into the nuclei, scrape loading, biolistic transformationand electroporation. Large scale production of proteins from recombinantorganisms is a well established process practiced on a commercial scaleand well within the capabilities of one skilled in the art.

It should be recognized that the present disclosure is not limited totransgenic cells, organisms, and plastids containing a protein orproteins as disclosed herein, but also encompasses such cells,organisms, and plastids transformed with additional nucleotide sequencesencoding enzymes involved in fatty acid synthesis. Thus, someembodiments involve the introduction of one or more sequences encodingproteins involved in fatty acid synthesis in addition to a proteindisclosed herein. For example, several enzymes in a fatty acidproduction pathway may be linked, either directly or indirectly, suchthat products produced by one enzyme in the pathway, once produced, arein close proximity to the next enzyme in the pathway. These additionalsequences may be contained in a single vector either operatively linkedto a single promoter or linked to multiple promoters, e.g. one promoterfor each sequence. Alternatively, the additional coding sequences may becontained in a plurality of additional vectors. When a plurality ofvectors are used, they can be introduced into the host cell or organismsimultaneously or sequentially.

Additional embodiments provide a plastid, and in particular achloroplast, transformed with a polynucleotide encoding a protein of thepresent disclosure. The protein may be introduced into the genome of theplastid using any of the methods described herein or otherwise known inthe art. The plastid may be contained in the organism in which itnaturally occurs. Alternatively, the plastid may be an isolated plastid,that is, a plastid that has been removed from the cell in which itnormally occurs. Methods for the isolation of plastids are known in theart and can be found, for example, in Maliga et al., Methods in PlantMolecular Biology, Cold Spring Harbor Laboratory Press, 1995; Gupta andSingh, J. Biosci., 21:819 (1996); and Camara et al. Plant Physiol.,73:94 (1983). The isolated plastid transformed with a protein of thepresent disclosure can be introduced into a host cell. The host cell canbe one that naturally contains the plastid or one in which the plastidis not naturally found.

Also within the scope of the present disclosure are artificial plastidgenomes, for example chloroplast genomes, that contain nucleotidesequences encoding any one or more of the proteins of the presentdisclosure. Methods for the assembly of artificial plastid genomes canbe found in co-pending U.S. patent application Ser. No. 12/287,230 filedOct. 6, 2008, published as U.S. Publication No. 2009/0123977 on May 14,2009, and U.S. patent application Ser. No. 12/384,893 filed Apr. 8,2009, published as U.S. Publication No. 2009/0269816 on Oct. 29, 2009,each of which is incorporated by reference in its entirety.

Introduction of Polynucleotide into a Host Organism or Cell

To generate a genetically modified host cell, a polynucleotide, or apolynucleotide cloned into a vector, is introduced stably or transientlyinto a host cell, using established techniques, including, but notlimited to, electroporation, calcium phosphate precipitation,DEAE-dextran mediated transfection, and liposome-mediated transfection.For transformation, a polynucleotide of the present disclosure willgenerally further include a selectable marker, e.g., any of severalwell-known selectable markers such as neomycin resistance, ampicillinresistance, tetracycline resistance, chloramphenicol resistance, andkanamycin resistance.

A polynucleotide or recombinant nucleic acid molecule described herein,can be introduced into a cell (e.g., alga cell) using any method knownin the art. A polynucleotide can be introduced into a cell by a varietyof methods, which are well known in the art and selected, in part, basedon the particular host cell. For example, the polynucleotide can beintroduced into a cell using a direct gene transfer method such aselectroporation or microprojectile mediated (biolistic) transformationusing a particle gun, or the “glass bead method,” or by pollen-mediatedtransformation, liposome-mediated transformation, transformation usingwounded or enzyme-degraded immature embryos, or wounded orenzyme-degraded embryogenic callus (for example, as described inPotrykus, Ann. Rev. Plant. Physiol. Plant Mol. Biol. 42:205-225, 1991).

As discussed above, microprojectile mediated transformation can be usedto introduce a polynucleotide into a cell (for example, as described inKlein et al., Nature 327:70-73, 1987). This method utilizesmicroprojectiles such as gold or tungsten, which are coated with thedesired polynucleotide by precipitation with calcium chloride,spermidine or polyethylene glycol. The microprojectile particles areaccelerated at high speed into a cell using a device such as theBIOLISTIC PD-1000 particle gun (BioRad; Hercules Calif.). Methods forthe transformation using biolistic methods are well known in the art(for example, as described in Christou, Trend in Plant Science1:423-431, 1996). Microprojectile mediated transformation has been used,for example, to generate a variety of transgenic plant species,including cotton, tobacco, corn, hybrid poplar and papaya. Importantcereal crops such as wheat, oat, barley, sorghum and rice also have beentransformed using microprojectile mediated delivery (for example, asdescribed in Duan et al., Nature Biotech. 14:494-498, 1996; andShimamoto, Curr. Opin. Biotech. 5:158-162, 1994). The transformation ofmost dicotyledonous plants is possible with the methods described above.Transformation of monocotyledonous plants also can be transformed using,for example, biolistic methods as described above, protoplasttransformation, electroporation of partially permeabilized cells,introduction of DNA using glass fibers, and the glass bead agitationmethod.

The basic techniques used for transformation and expression inphotosynthetic microorganisms are similar to those commonly used for E.coli, Saccharomyces cerevisiae and other species, Transformation methodscustomized for a photosynthetic microorganisms, e.g., the chloroplast ofa strain of algae, are known in the art. These methods have beendescribed in a number of texts for standard molecular biologicalmanipulation (see Packer & Glaser, 1988, “Cyanobacteria”, Meth.Enzymol., Vol. 167; Weissbach & Weissbach, 1988, “Methods for plantmolecular biology,” Academic Press, New York, Sambrook, Fritsch &Maniatis, 1989, “Molecular Cloning: A laboratory manual,” 2nd editionCold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; and ClarkM S, 1997, Plant Molecular Biology, Springer, N.Y.). These methodsinclude, for example, biolistic devices (See, for example, Sanford,Trends In Biotech. (1988) δ: 299-302, U.S. Pat. No. 4,945,050;electroporation (Fromm et al., Proc. Nat'l. Acad. Sci. (USA) (1985) 82:5824-5828); use of a laser beam, electroporation, microinjection or anyother method capable of introducing DNA into a host cell.

Plastid transformation is a routine and well known method forintroducing a polynucleotide into a plant cell chloroplast (see U.S.Pat. Nos. 5,451,513, 5,545,817, and 5,545,818; WO 95/16783; McBride etal., Proc. Natl. Acad. Sci., USA 91:7301-7305, 1994). In someembodiments, chloroplast transformation involves introducing regions ofchloroplast DNA flanking a desired nucleotide sequence, allowing forhomologous recombination of the exogenous DNA into the targetchloroplast genome. In some instances one to 1.5 kb flanking nucleotidesequences of chloroplast genomic DNA may be used. Using this method,point mutations in the chloroplast 16S rRNA and rps12 genes, whichconfer resistance to spectinomycin and streptomycin, can be utilized asselectable markers for transformation (Svab et al., Proc. Natl. Acad.Sci., USA 87:8526-8530, 1990), and can result in stable homoplasmictransformants, at a frequency of approximately one per 100 bombardmentsof target leaves.

A further refinement in chloroplast transformation/expression technologythat facilitates control over the timing and tissue pattern ofexpression of introduced DNA coding sequences in plant plastid genomeshas been described in PCT International Publication WO 95/16783 and U.S.Pat. No. 5,576,198. This method involves the introduction into plantcells of constructs for nuclear transformation that provide for theexpression of a viral single subunit RNA polymerase and targeting ofthis polymerase into the plastids via fusion to a plastid transitpeptide. Transformation of plastids with DNA constructs comprising aviral single subunit RNA polymerase-specific promoter specific to theRNA polymerase expressed from the nuclear expression constructs operablylinked to DNA coding sequences of interest permits control of theplastid expression constructs in a tissue and/or developmental specificmanner in plants comprising both the nuclear polymerase construct andthe plastid expression constructs. Expression of the nuclear RNApolymerase coding sequence can be placed under the control of either aconstitutive promoter, or a tissue- or developmental stage-specificpromoter, thereby extending this control to the plastid expressionconstruct responsive to the plastid-targeted, nuclear-encoded viral RNApolymerase.

When nuclear transformation is utilized, the protein can be modified forplastid targeting by employing plant cell nuclear transformationconstructs wherein DNA coding sequences of interest are fused to any ofthe available transit peptide sequences capable of facilitatingtransport of the encoded enzymes into plant plastids, and drivingexpression by employing an appropriate promoter. Targeting of theprotein can be achieved by fusing DNA encoding plastid, e.g.,chloroplast, leucoplast, amyloplast, etc., transit peptide sequences tothe 5′ end of DNAs encoding the enzymes. The sequences that encode atransit peptide region can be obtained, for example, from plantnuclear-encoded plastid proteins, such as the small subunit (SSU) ofribulose bisphosphate carboxylase, EPSP synthase, plant fatty acidbiosynthesis related genes including fatty acyl-ACP thioesterases, acylcarrier protein (ACP), stearoyl-ACP desaturase, β-ketoacyl-ACP synthaseand acyl-ACP thioesterase, or LHCPII genes, etc. Plastid transit peptidesequences can also be obtained from nucleic acid sequences encodingcarotenoid biosynthetic enzymes, such as GGPP synthase, phytoenesynthase, and phytoene desaturase. Other transit peptide sequences aredisclosed in Von Heijne et al. (1991) Plant Mol. Biol. Rep. 9: 104;Clark et al. (1989) J. Biol. Chem. 264: 17544; della-Cioppa et al.(1987) Plant Physiol. 84: 965; Romer et al. (1993) Biochem. Biophys.Res. Commun. 196: 1414; and Shah et al. (1986) Science 233: 478. Anothertransit peptide sequence is that of the intact ACCase from Chlamydomonas(genbank EDO96563, amino acids 1-33). The encoding sequence for atransit peptide effective in transport to plastids can include all or aportion of the encoding sequence for a particular transit peptide, andmay also contain portions of the mature protein encoding sequenceassociated with a particular transit peptide. Numerous examples oftransit peptides that can be used to deliver target proteins intoplastids exist, and the particular transit peptide encoding sequencesuseful in the present disclosure are not critical as long as deliveryinto a plastid is obtained. Proteolytic processing within the plastidthen produces the mature enzyme. This technique has proven successfulwith enzymes involved in polyhydroxyalkanoate biosynthesis (Nawrath etal. (1994) Proc. Natl. Acad. Sci. USA 91: 12760), and neomycinphosphotransferase II (NPT-II) and CP4 EPSPS (Padgette et al. (1995)Crop Sci. 35: 1451), for example.

Of interest are transit peptide sequences derived from enzymes known tobe imported into the leucoplasts of seeds. Examples of enzymescontaining useful transit peptides include those related to lipidbiosynthesis (e.g., subunits of the plastid-targeted dicot acetyl-CoAcarboxylase, biotin carboxylase, biotin carboxyl carrier protein,α-carboxy-transferase, and plastid-targeted monocot multifunctionalacetyl-CoA carboxylase (Mw, 220,000); plastidic subunits of the fattyacid synthase complex (e.g., acyl carrier protein (ACP), malonyl-ACPsynthase, KASI, KASII, and KASIII); steroyl-ACP desaturase;thioesterases (specific for short, medium, and long chain acyl ACP);plastid-targeted acyl transferases (e.g., glycerol-3-phosphate and acyltransferase); enzymes involved in the biosynthesis of aspartate familyamino acids; phytoene synthase; gibberellic acid biosynthesis (e.g.,ent-kaurene synthases 1 and 2); and carotenoid biosynthesis (e.g.,lycopene synthase).

Nuclear transformation of eukaryotic algal cells can be bymicroprojectile mediated transformation, or can be by protoplasttransformation, electroporation, introduction of DNA using glass fibers,or the glass bead agitation method, as nonlimiting examples (Kindle,Proc. Natl. Acad. Sciences USA 87: 1228-1232 (1990); Shimogawara et al.Genetics 148: 1821-1828 (1998)). Markers for nuclear transformation ofalgae include, without limitation, markers for rescuing auxotrophicstrains (e.g., NIT1 and ARG7 in Chlamydomonas; Kindle et al. J. CellBiol. 109: 2589-2601 (1989), Debuchy et al. EMBO J. 8: 2803-2809(1989)), as well as dominant selectable markers (e.g., CRY1, aada;Nelson et al. Mol. Cellular. Biol. 14: 4011-4019 (1994), Cerutti et al.Genetics 145: 97-110 (1997)). In some embodiments, the presence of theknock out is used as a selectable marker for transformants. A knock outsequence can in some embodiments be co-transformed with a secondsequence encoding a protein to be produced by the alga (for example, atherapeutic protein, industrial enzyme) or a protein that promotes orenhances production of a commercial, therapeutic, or nutritionalproduct. The second sequence is in some embodiments provided on the samenucleic acid construct as the knock out sequence for transformation intothe alga, in which the success of the knock out sequence in activatingthe gene of interest is used as the selectable marker.

In some embodiments, an alga is transformed with a nucleic acid whichencodes a protein of interest, for example, a prenyl transferase, anisoprenoid synthase, or an enzyme capable of converting a precursor intoa fuel product or a precursor of a fuel product (e.g., an isoprenoid orfatty acid).

In one embodiment, a transformation may introduce a nucleic acid into aplastid of the host alga (e.g., chloroplast). In another embodiments atransformation may introduce a nucleic acid into the nuclear genome ofthe host alga. In still another embodiment, a transformation mayintroduce nucleic acids into both the nuclear genome and into a plastid.

Transformed cells can be plated on selective media followingintroduction of exogenous nucleic acids. This method may also compriseseveral steps for screening. A screen of primary transformants can beconducted to determine which clones have proper insertion of theexogenous nucleic acids. Clones which show the proper integration may bepropagated and re-screened to ensure genetic stability. Such methodologyensures that the transformants contain the genes of interest. In manyinstances, such screening is performed by polymerase chain reaction(PCR); however, any other appropriate technique known in the art may beutilized. Many different methods of PCR are known in the art (e.g.,nested PCR, real time PCR). For any given screen, one of skill in theart will recognize that PCR components may be varied to achieve optimalscreening results. For example, magnesium concentration may need to beadjusted upwards when PCR is performed on disrupted alga cells to which(which chelates magnesium) is added to chelate toxic metals. Followingthe screening for clones with the proper integration of exogenousnucleic acids, clones can be screened for the presence of the encodedprotein(s) and/or products. Protein expression screening can beperformed by Western blot analysis and/or enzyme activity assays.Transporter and/or product screening may be performed by any methodknown in the art, for example ATP turnover assay, substrate transportassay, HPLC or gas chromatography.

The expression of the protein or enzyme can be accomplished by insertinga polynucleotide sequence (gene) encoding the protein or enzyme into thechloroplast or nuclear genome of a microalgae. The modified strain ofmicroalgae can be made homoplasmic to ensure that the polynucleotidewill be stably maintained in the chloroplast genome of all descendents.A microalga is homoplasmic for a gene when the inserted gene is presentin all copies of the chloroplast genome, for example, it is apparent toone of skill in the art that a chloroplast may contain multiple copiesof its genome, and therefore, the term “homoplasmic” or “homoplasmy”refers to the state where all copies of a particular locus of interestare substantially identical. Plastid expression, in which genes areinserted by homologous recombination into all of the several thousandcopies of the circular plastid genome present in each plant cell, takesadvantage of the enormous copy number advantage over nuclear-expressedgenes to permit expression levels that can readily exceed 10% or more ofthe total soluble plant protein. The process of determining the plasmicstate of an organism of the present disclosure involves screeningtransformants for the presence of exogenous nucleic acids and theabsence of wild-type nucleic acids at a given locus of interest.

Vectors

Construct, vector and plasmid are used interchangeably throughout thedisclosure. Nucleic acids encoding the proteins described herein, can becontained in vectors, including cloning and expression vectors. Acloning vector is a self-replicating DNA molecule that serves totransfer a DNA segment into a host cell. Three common types of cloningvectors are bacterial plasmids, phages, and other viruses. An expressionvector is a cloning vector designed so that a coding sequence insertedat a particular site will be transcribed and translated into a protein.Both cloning and expression vectors can contain nucleotide sequencesthat allow the vectors to replicate in one or more suitable host cells.In cloning vectors, this sequence is generally one that enables thevector to replicate independently of the host cell chromosomes, and alsoincludes either origins of replication or autonomously replicatingsequences.

In some embodiments, a polynucleotide of the present disclosure iscloned or inserted into an expression vector using cloning techniquesknow to one of skill in the art. The nucleotide sequences may beinserted into a vector by a variety of methods. In the most commonmethod the sequences are inserted into an appropriate restrictionendonuclease site(s) using procedures commonly known to those skilled inthe art and detailed in, for example, Sambrook et al., MolecularCloning, A Laboratory Manual, 2nd Ed., Cold Spring Harbor Press, (1989)and Ausubel et al., Short Protocols in Molecular Biology, 2nd Ed., JohnWiley & Sons (1992).

Suitable expression vectors include, but are not limited to, baculovirusvectors, bacteriophage vectors, plasmids, phagemids, cosmids, fosmids,bacterial artificial chromosomes, viral vectors (e.g. viral vectorsbased on vaccinia virus, poliovirus, adenovirus, adeno-associated virus,SV40, and herpes simplex virus), PI-based artificial chromosomes, yeastplasmids, yeast artificial chromosomes, and any other vectors specificfor specific hosts of interest (such as E coli and yeast). Thus, forexample, a polynucleotide encoding an FPP synthase, can be inserted intoanyone of a variety of expression vectors that are capable of expressingthe enzyme. Such vectors can include, for example, chromosomal,nonchromosomal and synthetic DNA sequences.

Suitable expression vectors include chromosomal, non-chromosomal andsynthetic DNA sequences, for example, SV 40 derivatives; bacterialplasmids; phage DNA; baculovirus; yeast plasmids; vectors derived fromcombinations of plasmids and phage DNA; and viral DNA such as vaccinia,adenovirus, fowl pox virus, and pseudorabies. In addition, any othervector that is replicable and viable in the host may be used. Forexample, vectors such as Ble2A, Arg7/2A, and SEnuc357 can be used forthe expression of a protein.

Numerous suitable expression vectors are known to those of skill in theart. The following vectors are provided by way of example; for bacterialhost cells: pQE vectors (Qiagen), pBluescript plasmids, pNH vectors,lambda-ZAP vectors (Stratagene), pTrc99a, pKK223-3, pDR540, and pRIT2T(Pharmacia); for eukaryotic host cells: pXT1, pSG5 (Stratagene), pSVK3,pBPV, pMSG, pET21a-d(+) vectors (Novagen), and pSVLSV40 (Pharmacia).However, any other plasmid or other vector may be used so long as it iscompatible with the host cell.

The expression vector, or a linearized portion thereof, can encode oneor more exogenous or endogenous nucleotide sequences. Examples ofexogenous nucleotide sequences that can be transformed into a hostinclude genes from bacteria, fungi, plants, photosynthetic bacteria orother algae. Examples of other types of nucleotide sequences that can betransformed into a host, include, but are not limited to, transportergenes, isoprenoid producing genes, genes which encode for proteins whichproduce isoprenoids with two phosphates (e.g., GPP synthase and/or FPPsynthase), genes which encode for proteins which produce fatty acids,lipids, or triglycerides, for example, ACCases, endogenous promoters,and 5′ UTRs from the psbA, atpA, or rbcL genes. In some instances, anexogenous sequence is flanked by two homologous sequences.

Homologous sequences are, for example, those that have at least 50%, atleast 60%, at least 70%, at least 80%, at least 90%, at least 95%, atleast 98%, or at least at least 99% sequence identity to a referenceamino acid sequence or nucleotide sequence, for example, the amino acidsequence or nucleotide sequence that is found naturally in the hostcell. The first and second homologous sequences enable recombination ofthe exogenous or endogenous sequence into the genome of the hostorganism. The first and second homologous sequences can be at least 100,at least 200, at least 300, at least 400, at least 500, or at least 1500nucleotides in length.

The polynucleotide sequence may comprise nucleotide sequences that arecodon biased for expression in the organism being transformed. Theskilled artisan is well aware of the “codon-bias” exhibited by aspecific host cell in usage of nucleotide codons to specify a givenamino acid. Without being bound by theory, by using a host cell'spreferred codons, the rate of translation may be greater. Therefore,when synthesizing a gene for improved expression in a host cell, it maybe desirable to design the gene such that its frequency of codon usageapproaches the frequency of preferred codon usage of the host cell. Insome organisms, codon bias differs between the nuclear genome andorganelle genomes, thus, codon optimization or biasing may be performedfor the target genome (e.g., nuclear codon biased or chloroplast codonbiased). In some embodiments, codon biasing occurs before mutagenesis togenerate a polypeptide. In other embodiments, codon biasing occurs aftermutagenesis to generate a polynucleotide. In yet other embodiments,codon biasing occurs before mutagenesis as well as after mutagenesis.Codon bias is described in detail herein.

In some embodiments, a vector comprises a polynucleotide operably linkedto one or more control elements, such as a promoter and/or atranscription terminator. A nucleic acid sequence is operably linkedwhen it is placed into a functional relationship with another nucleicacid sequence. For example, DNA for a presequence or secretory leader isoperatively linked to DNA for a polypeptide if it is expressed as apreprotein which participates in the secretion of the polypeptide; apromoter is operably linked to a coding sequence if it affects thetranscription of the sequence; or a ribosome binding site is operablylinked to a coding sequence if it is positioned so as to facilitatetranslation. Generally, operably linked sequences are contiguous and, inthe case of a secretory leader, contiguous and in reading phase. Linkingis achieved by ligation at restriction enzyme sites. If suitablerestriction sites are not available, then synthetic oligonucleotideadapters or linkers can be used as is known to those skilled in the art.Sambrook et al., Molecular Cloning, A Laboratory Manual, 2^(nd) Ed.,Cold Spring Harbor Press, (1989) and Ausubel et al., Short Protocols inMolecular Biology, 2^(nd) Ed., John Wiley & Sons (1992).

A vector in some embodiments provides for amplification of the copynumber of a polynucleotide. A vector can be, for example, an expressionvector that provides for expression of an ACCase, a prenyl transferase,an isoprenoid synthase, or a mevalonate synthesis enzyme in a host cell,e.g., a prokaryotic host cell or a eukaryotic host cell.

A polynucleotide or polynucleotides can be contained in a vector orvectors. For example, where a second (or more) nucleic acid molecule isdesired, the second nucleic acid molecule can be contained in a vector,which can, but need not be, the same vector as that containing the firstnucleic acid molecule. The vector can be any vector useful forintroducing a polynucleotide into a genome and can include a nucleotidesequence of genomic DNA (e.g., nuclear or plastid) that is sufficient toundergo homologous recombination with genomic DNA, for example, anucleotide sequence comprising about 400 to about 1500 or moresubstantially contiguous nucleotides of genomic DNA,

A regulatory or control element, as the term is used herein, broadlyrefers to a nucleotide sequence that regulates the transcription ortranslation of a polynucleotide or the localization of a polypeptide towhich it is operatively linked. Examples include, but are not limitedto, an RBS, a promoter, enhancer, transcription terminator, aninitiation (start) codon, a splicing signal for intron excision andmaintenance of a correct reading frame, a STOP codon, an amber or ochrecodon, and an IRES. A regulatory element can include a promoter andtranscriptional and translational stop signals. Elements may be providedwith linkers for the purpose of introducing specific restriction sitesfacilitating ligation of the control sequences with the coding region ofa nucleotide sequence encoding a polypeptide. Additionally, a sequencecomprising a cell compartmentalization signal (i.e., a sequence thattargets a polypeptide to the cytosol, nucleus, chloroplast membrane orcell membrane) can be attached to the polynucleotide encoding a proteinof interest. Such signals are well known in the art and have been widelyreported (see, e.g., U.S. Pat. No. 5,776,689).

Promoters are untranslated sequences located generally 100 to 1000 basepairs (bp) upstream from the start codon of a structural gene thatregulate the transcription and translation of nucleic acid sequencesunder their control.

Promoters useful for the present disclosure may come from any source(e.g., viral, bacterial, fungal, protist, and animal). The promoterscontemplated herein can be specific to photosynthetic organisms,non-vascular photosynthetic organisms, and vascular photosyntheticorganisms (e.g., algae, flowering plants). In some instances, thenucleic acids above are inserted into a vector that comprises a promoterof a photosynthetic organism, e.g., algae. The promoter can be aconstitutive promoter or an inducible promoter. A promoter typicallyincludes necessary nucleic acid sequences near the start site oftranscription, (e.g., a TATA element). Common promoters used inexpression vectors include, but are not limited to, LTR or SV40promoter, the E. coli lac or trp promoters, and the phage lambda PLpromoter. Other promoters known to control the expression of genes inprokaryotic or eukaryotic cells can be used and are known to thoseskilled in the art. Expression vectors may also contain a ribosomebinding site for translation initiation, and a transcription terminator.The vector may also contain sequences useful for the amplification ofgene expression.

A “constitutive” promoter is a promoter that is active under mostenvironmental and developmental conditions. An “inducible” promoter is apromoter that is active under controllable environmental ordevelopmental conditions. Examples of inducible promoters/regulatoryelements include, for example, a nitrate-inducible promoter (forexample, as described in Bock et al, Plant Mol. Biol. 17:9 (1991)), or alight-inducible promoter, (for example, as described in Feinbaum et al,Mol. Gen. Genet. 226:449 (1991); and Lam and Chua, Science 248:471(1990)), or a heat responsive promoter (for example, as described inMuller et al., Gene 111: 165-73 (1992)).

In many embodiments, a polynucleotide of the present disclosure includesa nucleotide sequence encoding a protein or enzyme of the presentdisclosure, where the nucleotide sequence encoding the polypeptide isoperably linked to an inducible promoter. Inducible promoters are wellknown in the art. Suitable inducible promoters include, but are notlimited to, the pL of bacteriophage λ; Placo; Ptrp; Ptac (Ptrp-lachybrid promoter); an isopropyl-beta-D-thiogalactopyranoside(IPTG)-inducible promoter, e.g., a lacZ promoter; atetracycline-inducible promoter; an arabinose inducible promoter, e.g.,P_(BAD) (for example, as described in Guzman et al. (1995) J. Bacteriol.177:4121-4130); a xylose-inducible promoter, e.g., Pxy1 (for example, asdescribed in Kim et al. (1996) Gene 181:71-76); a GAL1 promoter; atryptophan promoter; a lac promoter; an alcohol-inducible promoter,e.g., a methanol-inducible promoter, an ethanol-inducible promoter; araffinose-inducible promoter; and a heat-inducible promoter, e.g., heatinducible lambda P_(L) promoter and a promoter controlled by aheat-sensitive repressor (e.g., C1857-repressed lambda-based expressionvectors; for example, as described in Hoffmann et al. (1999) FEMSMicrobiol Lett. 177(2):327-34).

In many embodiments, a polynucleotide of the present disclosure includesa nucleotide sequence encoding a protein or enzyme of the presentdisclosure, where the nucleotide sequence encoding the polypeptide isoperably linked to a constitutive promoter. Suitable constitutivepromoters for use in prokaryotic cells are known in the art and include,but are not limited to, a sigma70 promoter, and a consensus sigma70promoter.

Suitable promoters for use in prokaryotic host cells include, but arenot limited to, a bacteriophage T7 RNA polymerase promoter; a trppromoter; a lac operon promoter; a hybrid promoter, e.g., a lac/lachybrid promoter, a tac/trc hybrid promoter, a trp/lac promoter, a T7/lacpromoter; a trc promoter; a tac promoter; an araBAD promoter; in vivoregulated promoters, such as an ssaG promoter or a related promoter (forexample, as described in U.S. Patent Publication No. 20040131637), apagC promoter (for example, as described in Pulkkinen and Miller, J.Bacteriol., 1991: 173(1): 86-93; and Alpuche-Aranda et al., PNAS, 1992;89(21): 10079-83), a nirB promoter (for example, as described inHarborne et al. (1992) Mol. Micro, 6:2805-2813; Dunstan et al. (1999)Infect. Immun. 67:5133-5141; McKelvie et al. (2004) Vaccine22:3243-3255; and Chatfield et al. (1992) Biotechnol. 10:888-892); asigma70 promoter, e.g., a consensus sigma70 promoter (for example,GenBank Accession Nos. AX798980, AX798961, and AX798183); a stationaryphase promoter, e.g., a dps promoter, an spy promoter; a promoterderived from the pathogenicity island SPI-2 (for example, as describedin WO96/17951); an actA promoter (for example, as described inShetron-Rama et al. (2002) Infect. Immun. 70:1087-1096); an rpsMpromoter (for example, as described in Valdivia and Falkow (1996). Mol.Microbiol. 22:367-378); a tet promoter (for example, as described inHillen, W. and Wissmann, A. (1989) In Saenger, W. and Heinemann, U.(eds), Topics in Molecular and Structural Biology, Protein-Nucleic AcidInteraction. Macmillan, London, UK, Vol. 10, pp. 143-162); and an SP6promoter (for example, as described in Melton et al. (1984) Nucl. AcidsRes. 12:7035-7056).

In yeast, a number of vectors containing constitutive or induciblepromoters may be used. For a review of such vectors see, CurrentProtocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel, et al.,Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant, et al.,1987, Expression and Secretion Vectors for Yeast, in Methods inEnzymology, Eds. Wu & Grossman, 31987, Acad. Press, N.Y., Vol. 153, pp.516-544; Glover, 1986, DNA Cloning, Vol. II, IRL Press, Wash., D.C., Ch.3; Bitter, 1987, Heterologous Gene Expression in Yeast, Methods inEnzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, pp.673-684; and The Molecular Biology of the Yeast Saccharomyces, 1982,Eds. Strathern et al., Cold Spring Harbor Press, Vols. I and II. Aconstitutive yeast promoter such as ADH or LEU2 or an inducible promotersuch as GAL may be used (for example, as described in Cloning in Yeast,Ch. 3, R. Rothstein In: DNA Cloning Vol. 11, A Practical Approach, Ed.DM Glover, 1986, IRL Press, Wash., D.C.). Alternatively, vectors may beused which promote integration of foreign DNA sequences into the yeastchromosome.

Non-limiting examples of suitable eukaryotic promoters include CMVimmediate early, HSV thymidine kinase, early and late SV40, LTRs fromretrovirus, and mouse metallothionein-1. Selection of the appropriatevector and promoter is well within the level of ordinary skill in theart. The expression vector may also contain a ribosome binding site fortranslation initiation and a transcription terminator. The expressionvector may also include appropriate sequences for amplifying expression.

A vector utilized in the practice of the disclosure also can contain oneor more additional nucleotide sequences that confer desirablecharacteristics on the vector, including, for example, sequences such ascloning sites that facilitate manipulation of the vector, regulatoryelements that direct replication of the vector or transcription ofnucleotide sequences contain therein, and sequences that encode aselectable marker. As such, the vector can contain, for example, one ormore cloning sites such as a multiple cloning site, which can, but neednot, be positioned such that a exogenous or endogenous polynucleotidecan be inserted into the vector and operatively linked to a desiredelement.

The vector also can contain a prokaryote origin of replication (ori),for example, an E. coli ori or a cosmid ori, thus allowing passage ofthe vector into a prokaryote host cell, as well as into a plantchloroplast. Various bacterial and viral origins of replication are wellknown to those skilled in the art and include, but are not limited tothe pBR322 plasmid origin, the 2u plasmid origin, and the SV40, polyoma,adenovirus, VSV, and BPV viral origins.

A regulatory or control element, as the term is used herein, broadlyrefers to a nucleotide sequence that regulates the transcription ortranslation of a polynucleotide or the localization of a polypeptide towhich it is operatively linked. Examples include, but are not limitedto, an RBS, a promoter, enhancer, transcription terminator, aninitiation (start) codon, a splicing signal for intron excision andmaintenance of a correct reading frame, a STOP codon, an amber or ochrecodon, an IRES. Additionally, an element can be a cellcompartmentalization signal (i.e., a sequence that targets a polypeptideto the cytosol, nucleus, chloroplast membrane or cell membrane). In someaspects of the present disclosure, a cell compartmentalization signal(e.g., a cell membrane targeting sequence) may be ligated to a geneand/or transcript, such that translation of the gene occurs in thechloroplast. In other aspects, a cell compartmentalization signal may beligated to a gene such that, following translation of the gene, theprotein is transported to the cell membrane. Cell compartmentalizationsignals are well known in the art and have been widely reported (see,e.g., U.S. Pat. No. 5,776,689).

A vector, or a linearized portion thereof, may include a nucleotidesequence encoding a reporter polypeptide or other selectable marker. Theterm “reporter” or “selectable marker” refers to a polynucleotide (orencoded polypeptide) that confers a detectable phenotype. A reportergenerally encodes a detectable polypeptide, for example, a greenfluorescent protein or an enzyme such as luciferase, which, whencontacted with an appropriate agent (a particular wavelength of light orluciferin, respectively) generates a signal that can be detected by eyeor using appropriate instrumentation (for example, as described inGiacomin, Plant Sci. 116:59-72, 1996; Scikantha, J. Bacteriol. 178:121,1996; Gerdes, FEBS Lett. 389:44-47, 1996; and Jefferson. EMBO J.6:3901-3907, 1997, fl-glucuronidase). A selectable marker generally is amolecule that, when present or expressed in a cell, provides a selectiveadvantage (or disadvantage) to the cell containing the marker, forexample, the ability to grow in the presence of an agent that otherwisewould kill the cell.

A selectable marker can provide a means to obtain, for example,prokaryotic cells, eukaryotic cells, and/or plant cells that express themarker and, therefore, can be useful as a component of a vector of thedisclosure. The selection gene or marker can encode for a proteinnecessary for the survival or growth of the host cell transformed withthe vector. One class of selectable markers are native or modified geneswhich restore a biological or physiological function to a host cell(e.g., restores photosynthetic capability or restores a metabolicpathway). Other examples of selectable markers include, but are notlimited to, those that confer antimetabolite resistance, for example,dihydrofolate reductase, which confers resistance to methotrexate (forexample, as described in Reiss, Plant Physiol. (Life Sci. Adv.)13:143-149, 1994); neomycin phosphotransferase, which confers resistanceto the aminoglycosides neomycin, kanamycin and paromycin (for example,as described in Herrera-Estrella, EMBO J. 2:987-995, 1983), hygro, whichconfers resistance to hygromycin (for example, as described in Marsh,Gene 32:481-485, 1984), trpB, which allows cells to utilize indole inplace of tryptophan; hisD, which allows cells to utilize histinol inplace of histidine (for example, as described in Hartman, Proc. Natl.Acad. Sci., USA 85:8047, 1988); mannose-6-phosphate isomerase whichallows cells to utilize mannose (for example, as described in PCTPublication Application No. WO 94/20627); ornithine decarboxylase, whichconfers resistance to the ornithine decarboxylase inhibitor,2-(difluoromethyl)-DL-ornithine (DFMO; for example, as described inMcConlogue, 1987, In: Current Communications in Molecular Biology, ColdSpring Harbor Laboratory ed.); and deaminase from Aspergillus terreus,which confers resistance to Blasticidin S (for example, as described inTamura, Biosci. Biotechnol. Biochem. 59:2336-2338, 1995). Additionalselectable markers include those that confer herbicide resistance, forexample, phosphinothricin acetyltransferase gene, which confersresistance to phosphinothricin (for example, as described in White etal., Nucl. Acids Res. 18:1062, 1990; and Spencer et al., Theor. Appl.Genet. 79:625-631, 1990), a mutant EPSPV-synthase, which confersglyphosate resistance (for example, as described in Hinchee et al.,BioTechnology 91:915-922, 1998), a mutant acetolactate synthase, whichconfers imidazolione or sulfonylurea resistance (for example, asdescribed in Lee et al., EMBO J. 7:1241-1248, 1988), a mutant psbA,which confers resistance to atrazine (for example, as described in Smedaet al., Plant Physiol. 103:911-917, 1993), or a mutantprotoporphyrinogen oxidase (for example, as described in U.S. Pat. No.5,767,373), or other markers conferring resistance to an herbicide suchas glufosinate. Selectable markers include polynucleotides that conferdihydrofolate reductase (DHFR) or neomycin resistance for eukaryoticcells; tetramycin or ampicillin resistance for prokaryotes such as E.coli; and bleomycin, gentamycin, glyphosate, hygromycin, kanamycin,methotrexate, phleomycin, phosphinotricin, spectinomycin, dtreptomycin,streptomycin, sulfonamide and sulfonylurea resistance in plants (forexample, as described in Maliga et al., Methods in Plant MolecularBiology, Cold Spring Harbor Laboratory Press, 1995, page 39). Theselection marker can have its own promoter or its expression can bedriven by a promoter driving the expression of a polypeptide ofinterest.

Reporter genes greatly enhance the ability to monitor gene expression ina number of biological organisms. Reporter genes have been successfullyused in chloroplasts of higher plants, and high levels of recombinantprotein expression have been reported. In addition, reporter genes havebeen used in the chloroplast of C. reinhardtii. In chloroplasts ofhigher plants, β-glucuronidase (uidA, for example, as described in Stauband Maliga, EMBO J. 12:601-606, 1993), neomycin phosphotransferase(nptII, for example, as described in Carrer et al., Mol. Gen. Genet.241:49-56, 1993), adenosyl-3-adenyltransf-erase (aadA, for example, asdescribed in Svab and Maliga, Proc. Natl. Acad. Sci., USA 90:913-917,1993), and the Aequorea victoria GFP (for example, as described inSidorov et al., Plant J. 19:209-216, 1999) have been used as reportergenes (for example, as described in Heifetz, Biochemie 82:655-666,2000). Each of these genes has attributes that make them usefulreporters of chloroplast gene expression, such as ease of analysis,sensitivity, or the ability to examine expression in situ. Based uponthese studies, other exogenous proteins have been expressed in thechloroplasts of higher plants such as Bacillus thuringiensis Cry toxins,conferring resistance to insect herbivores (for example, as described inKota et al., Proc. Natl. Acad. Sci., USA 96:1840-1845, 1999), or humansomatotropin (for example, as described in Staub et al., Nat.Biotechnol. 18:333-338, 2000), a potential biopharmaceutical. Severalreporter genes have been expressed in the chloroplast of the eukaryoticgreen alga, C. reinhardtii, including aadA (for example, as described inGoldschmidt-Clermont. Nucl. Acids Res. 19:4083-4089 1991; and Zerges andRochaix, Mol. Cell Biol. 14:5268-5277, 1994), uidA (for example, asdescribed in Sakamoto et al., Proc. Natl. Acad. Sci., USA 90:477-501,1993; and Ishikura et al., J. Biosci. Bioeng. 87:307-314 1999), Renillaluciferase (for example, as described in Minko et al., Mol. Gen. Genet.262:421-425, 1999) and the amino glycoside phosphotransferase fromAcinetobacter baumanii, aphA6 (for example, as described in Bateman andPurton, Mol. Gen. Genet 263:404-410, 2000). In one embodiment theprotein described herein is modified by the addition of an N-terminalstrep tag epitope to add in the detection of protein expression.

In some instances, the vectors of the present disclosure will containelements such as an E. coli or S. cerevisiae origin of replication. Suchfeatures, combined with appropriate selectable markers, allows for thevector to be “shuttled” between the target host cell and a bacterialand/or yeast cell. The ability to passage a shuttle vector of thedisclosure in a secondary host may allow for more convenientmanipulation of the features of the vector. For example, a reactionmixture containing the vector and inserted polynucleotide(s) of interestcan be transformed into prokaryote host cells such as E. coli, amplifiedand collected using routine methods, and examined to identify vectorscontaining an insert or construct of interest. If desired, the vectorcan be further manipulated, for example, by performing site directedmutagenesis of the inserted polynucleotide, then again amplifying andselecting vectors having a mutated polynucleotide of interest. A shuttlevector then can be introduced into plant cell chloroplasts, wherein apolypeptide of interest can be expressed and, if desired, isolatedaccording to a method of the disclosure.

Knowledge of the chloroplast or nuclear genome of the host organism, forexample, C. reinhardtii, is useful in the construction of vectors foruse in the disclosed embodiments. Chloroplast vectors and methods forselecting regions of a chloroplast genome for use as a vector are wellknown (see, for example, Bock, J. Mol. Biol. 312:425-438, 2001; Stauband Maliga, Plant Cell 4:39-45, 1992; and Kavanagh et al., Genetics152:1111-1122, 1999, each of which is incorporated herein by reference).The entire chloroplast genome of C. reinhardtii is available to thepublic on the world wide web, at the URL“biology.duke.edu/chlamy_genome/-chloro.html” (see “view complete genomeas text file” link and “maps of the chloroplast genome” link; J. Maul,J. W. Lilly, and D. B. Stern, unpublished results; revised Jan. 28,2002; to be published as GenBank Ace. No. AF396929; and Maul, J. E., etal. (2002) The Plant Cell, Vol. 14 (2659-2679)). Generally, thenucleotide sequence of the chloroplast genomic DNA that is selected foruse is not a portion of a gene, including a regulatory sequence orcoding sequence. For example, the selected sequence is not a gene thatif disrupted, due to the homologous recombination event, would produce adeleterious effect with respect to the chloroplast. For example, adeleterious effect on the replication of the chloroplast genome or to aplant cell containing the chloroplast. In this respect, the websitecontaining the C. reinhardtii chloroplast genome sequence also providesmaps showing coding and non-coding regions of the chloroplast genome,thus facilitating selection of a sequence useful for constructing avector (also described in Maul, J. E., et al. (2002) The Plant Cell,Vol. 14 (2659-2679)). For example, the chloroplast vector, p322, is aclone extending from the Eco (Eco RI) site at about position 143.1 kb tothe Xho (Xho I) site at about position 148.5 kb (see, world wide web, atthe URL “biology.duke.edu/chlamy_genome/chloro.html”, and clicking on“maps of the chloroplast genome” link, and “140-150 kb” link; alsoaccessible directly on world wide web at URL“biology.duke.edu/chlam-y/chloro/chlorol40.html”).

In addition, the entire nuclear genome of C. reinhardtii is described inMerchant, S. S., et al., Science (2007), 318(5848):245-250, thusfacilitating one of skill in the art to select a sequence or sequencesuseful for constructing a vector.

For expression of the polypeptide in a host, an expression cassette orvector may be employed. The expression vector will provide atranscriptional and translational initiation region, which may beinducible or constitutive, where the coding region is operably linkedunder the transcriptional control of the transcriptional initiationregion, and a transcriptional and translational termination region.These control regions may be native to the gene, or may be derived froman exogenous source. Expression vectors generally have convenientrestriction sites located near the promoter sequence to provide for theinsertion of nucleic acid sequences encoding exogenous or endogenousproteins. A selectable marker operative in the expression host may bepresent.

The nucleotide sequences may be inserted into a vector by a variety ofmethods. In the most common method the sequences are inserted into anappropriate restriction endonuclease site(s) using procedures commonlyknown to those skilled in the art and detailed in, for example, Sambrooket al., Molecular Cloning, A Laboratory Manual, 2^(nd) Ed., Cold SpringHarbor Press, (1989) and Ausubel et al., Short Protocols in MolecularBiology, 2^(nd) Ed., John Wiley & Sons (1992).

The description herein provides that host cells may be transformed withvectors. One of skill in the art will recognize that such transformationincludes transformation with circular or linearized vectors, orlinearized portions of a vector. Thus, a host cell comprising a vectormay contain the entire vector in the cell (in either circular or linearform), or may contain a linearized portion of a vector of the presentdisclosure. In some instances 0.5 to 1.5 kb flanking nucleotidesequences of chloroplast genomic DNA may be used. In some instances 0.5to 1.5 kb flanking nucleotide sequences of nuclear genomic DNA may beused, or 2.0 to 5.0 kb may be used.

Compounds

The modified or transformed host organism disclosed herein is useful inthe production of a desired biomolecule, compound, composition, orproduct; these terms can be used interchangeably. The present disclosureprovides methods of producing, for example, an isoprenoid or isoprenoidprecursor compound in a host cell. One such method involves, culturing amodified host cell in a suitable culture medium under conditions thatpromote synthesis of a product, for example, an isoprenoid compound orisoprenoid precursor compound, where the isoprenoid compound isgenerated by the expression of an enzyme of the present disclosure,wherein the enzyme uses a substrate present in the host cell. In someembodiments, a method further comprises isolating the isoprenoidcompound from the cell and/or from the culture medium.

In some embodiments, the product (e.g. fuel molecule) is collected byharvesting the liquid medium. As some fuel molecules (e.g.,monoterpenes) are immiscible in water, they would float to the surfaceof the liquid medium and could be extracted easily, for example byskimming. In other instances, the fuel molecules can be extracted fromthe liquid medium. In still other instances, the fuel molecules arevolatile. In such instances, impermeable barriers can cover or otherwisesurround the growth environment and can be extracted from the air withinthe barrier. For some fuel molecules, the product may be extracted fromboth the environment (e.g., liquid environment and/or air) and from theintact host cells. Typically, the organism would be harvested at anappropriate point and the product may then be extracted from theorganism. The collection of cells may be by any means known in the art,including, but not limited to concentrating cells, mechanical orchemical disruption of cells, and purification of product(s) from cellcultures and/or cell lysates. Cells and/or organisms can be grown andthen the product(s) collected by any means known to one of skill in theart. One method of extracting the product is by harvesting the host cellor a group of host cells and then drying the cell(s). The product(s)from the dried host cell(s) are then harvested by crushing the cells toexpose the product. In some instances, the product may be producedwithout killing the organisms. Producing and/or expressing the productmay not render the organism unviable.

In some embodiments, a genetically modified host cell is cultured in asuitable medium (e.g., Luria-Bertoni broth, optionally supplemented withone or more additional agents, such as an inducer (e.g., where theisoprenoid synthase is under the control of an inducible promoter); andthe culture medium is overlaid with an organic solvent, e.g. dodecane,forming an organic layer. The compound produced by the geneticallymodified host partitions into the organic layer, from which it can thenbe purified. In some embodiments, where, for example, a prenyltransferase, isoprenoid synthase or mevalonate synthesis-encodingnucleotide sequence is operably linked to an inducible promoter, aninducer is added to the culture medium; and, after a suitable time, thecompound is isolated from the organic layer overlaid on the culturemedium.

In some embodiments, the compound or product, for example, an isoprenoidcompound will be separated from other products which may be present inthe organic layer. Separation of the compound from other products thatmay be present in the organic layer is readily achieved using, e.g.,standard chromatographic techniques.

Methods of culturing the host cells, separating products, and isolatingthe desired product or products are known to one of skill in the art andare discussed further herein.

In some embodiments, the compound, for example, an isoprenoid orisoprenoid compound is produced in a genetically modified host cell at alevel that is at least about 2-fold, at least about 5-fold, at leastabout 10-fold, at least about 25-fold, at least about 50-fold, at leastabout 100-fold, at least about 500-fold, at least about 1000-fold, atleast about 2000-fold, at least about 3000-fold, at least about4000-fold, at least about 5000-fold, or at least about 10,000-fold, ormore, higher than the level of the isoprenoid or isoprenoid precursorcompound produced in an unmodified host cell that produces theisoprenoid or isoprenoid precursor compound via the same biosyntheticpathway.

In some embodiments, the compound, for example, an isoprenoid compoundis pure, e.g., at least about 40% pure, at least about 50% pure, atleast about 60% pure, at least about 70% pure, at least about 80% pure,at least about 90% pure, at least about 95% pure, at least about 98%, ormore than 98% pure. “Pure” in the context of an isoprenoid compoundrefers to an isoprenoid compound that is free from other isoprenoidcompounds, portions of compounds, contaminants, and unwanted byproducts,for example.

Examples of products contemplated herein include hydrocarbon productsand hydrocarbon derivative products. A hydrocarbon product is one thatconsists of only hydrogen molecules and carbon molecules. A hydrocarbonderivative product is a hydrocarbon product with one or moreheteroatoms, wherein the heteroatom is any atom that is not hydrogen orcarbon. Examples of heteroatoms include, but are not limited to,nitrogen, oxygen, sulfur, and phosphorus. Some products can behydrocarbon-rich, wherein, for example, at least 50%, at least 60%, atleast 70%, at least 80%, at least 90%, or at least 95% of the product byweight is made up of carbon and hydrogen.

One exemplary group of hydrocarbon products are isoprenoids. Isoprenoids(including terpenoids) are derived from isoprene subunits, but aremodified, for example, by the addition of heteroatoms such as oxygen, bycarbon skeleton rearrangement, and by alkylation. Isoprenoids generallyhave a number of carbon atoms which is evenly divisible by five, butthis is not a requirement as “irregular” terpenoids are known to one ofskill in the art, Carotenoids, such as carotenes and xanthophylls, areexamples of isoprenoids that are useful products. A steroid is anexample of a terpenoid. Examples of isoprenoids include, but are notlimited to, hemiterpenes (C5), monoterpenes (C10), sesquiterpenes (C15),diterpenes (C20), triterpenes (C30), tetraterpenes (C40), polyterpenes(C_(n), wherein “n” is equal to or greater than 45), and theirderivatives. Other examples of isoprenoids include, but are not limitedto, limonene, 1,8-cineole, α-pinene, camphene, (+)-sabinene, myrcene,abietadiene, taxadiene, farnesyl pyrophosphate, fusicoccadiene,amorphadiene, (E)-α-bisabolene, zingiberene, or diapophytoene, and theirderivatives.

Products, for example fuel products, comprising hydrocarbons, may beprecursors or products conventionally derived from crude oil, orpetroleum, such as, but not limited to, liquid petroleum gas, naptha(ligroin), gasoline, kerosene, diesel, lubricating oil, heavy gas, coke,asphalt, tar, and waxes.

Useful products include, but are not limited to, terpenes and terpenoidsas described above. An exemplary group of terpenes are diterpenes (C20),Diterpenes are hydrocarbons that can be modified (e.g. oxidized, methylgroups removed, or cyclized); the carbon skeleton of a diterpene can berearranged, to form, for example, terpenoids, such as fusicoccadiene.Fusicoccadiene may also be formed, for example, directly from theisoprene precursors, without being bound by the availability ofditerpene or GGDP. Genetic modification of organisms, such as algae, bythe methods described herein, can lead to the production offusicoccadiene, for example, and other types of terpenes, such aslimonene, for example. Genetic modification can also lead to theproduction of modified terpenes, such as methyl squalene or hydroxylatedand/or conjugated terpenes such as paclitaxel.

Other useful products can be, for example, a product comprising ahydrocarbon obtained from an organism expressing a diterpene synthase.Such exemplary products include ent-kaurene, casbene, andfusicoccadiene, and may also include fuel additives.

In some embodiments, a product (such as a fuel product) contemplatedherein comprises one or more carbons derived from an inorganic carbonsource. In some embodiments, at least 10%, at least 20%, at least 30%,at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, atleast 90%, at least 95%, or at least 99% of the carbons of a product asdescribed herein are derived from an inorganic carbon source. Examplesof inorganic carbon sources include, but are not limited to, carbondioxide, carbonate, bicarbonate, and carbonic acid. The product can be,for example, an organic molecule with carbons from an inorganic carbonsource that were fixed during photosynthesis.

The products produced by the present disclosure may be naturally, ornon-naturally (e.g., as a result of transformation) produced by the hostcell(s) and/or organism(s) transformed. For example, products notnaturally produced by algae may include non-native terpenes/terpenoidssuch as fusicoccadiene or limonene. A product naturally produced inalgae may be a terpene such as a carotenoid (for example,beta-carotene). The host cell may be genetically modified, for example,by transformation of the cell with a sequence encoding a protein,wherein expression of the protein results in the secretion of anaturally or a non-naturally produced product or products. The productmay be a molecule not found in nature.

Examples of products include petrochemical products, precursors ofpetrochemical products, fuel products, petroleum products, precursors ofpetroleum products, and all other substances that may be useful in thepetrochemical industry. The product may be used for generatingsubstances, or materials, useful in the petrochemical industry. Theproducts may be used in a combustor such as a boiler, kiln, dryer orfurnace. Other examples of combustors are internal combustion enginessuch as vehicle engines or generators, including gasoline engines,diesel engines, jet engines, and other types of engines. In oneembodiment, a method herein comprises combusting a refined or “upgraded”composition. For example, combusting a refined composition can compriseinserting the refined composition into a combustion engine, such as anautomobile engine or a jet engine. Products described herein may also beused to produce plastics, resins, fibers, elastomers, pharmaceuticals,neutraceuticals, lubricants, and gels, for example.

Useful products can also include isoprenoid precursors. Isoprenoidprecursors are generated by one of two pathways; the mevalonate pathwayor the methylerythritol phosphate (MEP) pathway. Both pathways generatedimethylallyl pyrophosphate (DMAPP) and isopentyl pyrophosphate (IPP),the common C5 precursor for isoprenoids. The DMAPP and IPP are condensedto form geranyl-diphosphate (GPP), or other precursors, such asfarnesyl-diphosphate (FPP) or geranylgeranyl-diphosphate (GGPP), fromwhich higher isoprenoids are formed.

Useful products can also include small alkanes (for example, 1 toapproximately 4 carbons) such as methane, ethane, propane, or butane,which may be used for heating (such as in cooking) or making plastics.Products may also include molecules with a carbon backbone ofapproximately 5 to approximately 9 carbon atoms, such as naptha orligroin, or their precursors. Other products may include molecules witha carbon background of about 5 to about 12 carbon atoms, or cycloalkanesused as gasoline or motor fuel. Molecules and aromatics of approximately10 to approximately 18 carbons, such as kerosene, or its precursors, mayalso be useful as products. Other products include lubricating oil,heavy gas oil, or fuel oil, or their precursors, and can containalkanes, cycloalkanes, or aromatics of approximately 12 to approximately70 carbons. Products also include other residuals that can be derivedfrom or found in crude oil, such as coke, asphalt, tar, and waxes,generally containing multiple rings with about 70 or more carbons, andtheir precursors.

Modified organisms can be grown, in some embodiments in the presence ofCO₂, to produce a desired polypeptide. In some embodiments, the productsproduced by the modified organism are isolated or collected. Collectedproducts, such as terpenes and terpenoids, may then be further modified,for example, by refining and/or cracking to produce fuel molecules orcomponents.

The various products may be further refined to a final product for anend user by a number of processes. Refining can, for example, occur byfractional distillation. For example, a mixture of products, such as amix of different hydrocarbons with various chain lengths may beseparated into various components by fractional distillation.

Refining may also include any one or more of the following steps,cracking, unifying, or altering the product. Large products, such aslarge hydrocarbons (e.g. ≧C10), may be broken down into smallerfragments by cracking. Cracking may be performed by heat or highpressure, such as by steam, visbreaking, or coking. Products may also berefined by visbreaking, for example by thermally cracking largehydrocarbon molecules in the product by heating the product in afurnace. Refining may also include coking, wherein a heavy, almost purecarbon residue is produced. Cracking may also be performed by catalyticmeans to enhance the rate of the cracking reaction by using catalystssuch as, but not limited to, zeolite, aluminum hydrosilicate, bauxite,or silica-alumina. Catalysis may be by fluid catalytic cracking, wherebya hot catalyst, such as zeolite, is used to catalyze cracking reactions.Catalysis may also be performed by hydrocracking, where lowertemperatures are generally used in comparison to fluid catalyticcracking. Hydrocracking can occur in the presence of elevated partialpressure of hydrogen gas. Products may be refined by catalytic crackingto generate diesel, gasoline, and/or kerosene,

The products may also be refined by combining them in a unificationstep, for example by using catalysts, such as platinum or aplatinum-rhenium mix. The unification process can produce hydrogen gas,a by-product, which may be used in cracking.

The products may also be refined by altering, rearranging, orrestructuring hydrocarbons into smaller molecules. There are a number ofchemical reactions that occur in catalytic reforming processes which areknown to one of ordinary skill in the arts, Catalytic reforming can beperformed in the presence of a catalyst and a high partial pressure ofhydrogen. One common process is alkylation. For example, propylene andbutylene are mixed with a catalyst such as hydrofluoric acid or sulfuricacid, and the resulting products are high octane hydrocarbons, which canbe used to reduce knocking in gasoline blends.

The products may also be blended or combined into mixtures to obtain anend product. For example, the products may be blended to form gasolineof various grades, gasoline with or without additives, lubricating oilsof various weights and grades, kerosene of various grades, jet fuel,diesel fuel, heating oil, and chemicals for making plastics and otherpolymers. Compositions of the products described herein may be combinedor blended with fuel products produced by other means.

Some products produced from the host cells of the disclosure, especiallyafter refining, will be identical to existing petrochemicals, i.e.contain the same chemical structure. For instance, crude oil containsthe isoprenoid pristane, which is thought to be a breakdown product ofphytol, which is a component of chlorophyll. Some of the products maynot be the same as existing petrochemicals. However, although a moleculemay not exist in conventional petrochemicals or refining, it may stillbe useful in these industries. For example, a hydrocarbon could beproduced that is in the boiling point range of gasoline, and that couldbe used as gasoline or an additive, even though the hydrocarbon does notnormally occur in gasoline.

A product herein can be described by its Carbon Isotope Distribution(CID). At the molecular level, a CID is the statistical likelihood of asingle carbon atom within a molecule to be one of the naturallyoccurring carbon isotopes (for example, ¹²C, ¹³C, or ¹⁴C). At the bulklevel of a product, a CID may be the relative abundance of naturallyoccurring carbon isotopes (for example, ¹²C, ¹³C, or ¹⁴C) in a compoundcontaining at least one carbon atom. It is noted that the CID of afossil fuel may differ based on its source. For example, with CID(fos),the CID of carbon in a fossil fuel, such as petroleum, natural gas, andcoal is distinguishable from the CID(atm), the CID of carbon in currentatmospheric carbon dioxide. Additionally, the CID(photo-atm) refers tothe CID of a carbon-based compound made by photosynthesis in recenthistory where the source of inorganic carbon was carbon dioxide in theatmosphere. Also, CID(photo-fos) refers to the CID of a carbon basedcompound made by photosynthesis in recent history where the source ofsubstantially all of the inorganic carbon was carbon dioxide produced bythe burning of fossil fuels (for example, coal, natural gas, and/orpetroleum). The exact distribution is also a characteristic of 1) thetype of photosynthetic organism that produced the molecule, and 2) thesource of inorganic carbon. These isotope distributions can be used todefine the composition of photosynthetically-derived fuel products.Carbon isotopes are unevenly distributed among and within differentcompounds and the isotopic distribution can reveal information about thephysical, chemical, and metabolic processes involved in carbontransformation. The overall abundance of ¹³C relative to ¹²C in aphotosynthetic organism is often less than the overall abundance of ¹³Crelative to ¹²C in atmospheric carbon dioxide, indicating that carbonisotope discrimation occurs in the incorporation of carbon dioxide intophotosynthetic biomass.

A product, either before or after refining, can be identical to anexisting petrochemical. Some of the fuel products may not be the same asexisting petrochemicals. In one embodiment, a fuel product is similar toan existing petrochemical, except for the carbon isotope distribution.For example, it is believed that no fossil fuel petrochemicals have aδ¹³C distribution of less than −32%, whereas fuel products as describedherein can have a δ¹³C distribution of less than −32%, less than −35%,less than −40%, less than −45%, less than −50%, less than −55%, or lessthan −60%. In another embodiment, a fuel product or composition issimilar but not the same as an existing fossil fuel petrochemical andhas a δ¹³C distribution of less than −32%, less than −35% less than−40%, less than −45%, less than −50%, less than −55% or less than −60%.

A fuel product can be a composition comprising, for example, hydrogenand carbon molecules, wherein the hydrogen and carbon molecules are atleast about 80% of the atomic weight of the composition, and wherein theδ¹³C distribution of the composition is less than about −32%. For somefuel products described herein, the hydrogen and carbon molecules are atleast 90% of the atomic weight of the composition. For example, abiodiesel or fatty acid methyl ester (which has less than 90% hydrogenand carbon molecules by weight) may not be part of the composition. Instill other compositions, the hydrogen and carbon molecules are at least95 or at least 99% of the atomic weight of the composition. In yet othercompositions, the hydrogen and carbon molecules are 100% of the atomicweight of the composition. In some embodiments, the composition is aliquid. In other embodiments, the composition is a fuel additive or afuel product.

Also described herein is a fuel product comprising a compositioncomprising: hydrogen and carbon molecules, wherein the hydrogen andcarbon molecules are at least 80% of the atomic weight of thecomposition, and wherein the δ¹³C distribution of the composition isless than −32%; and a fuel component. In some embodiments, the δ¹³Cdistribution of the composition is less than about −35%, less than about−40%, less than about −45%, less than about −50%, less than about −55%,or less than about −60%. In some embodiments, the fuel component of thecomposition is a blending fuel, for example, a fossil fuel, gasoline,diesel, ethanol, jet fuel, or any combination thereof. In still otherembodiments, the blending fuel has a δ¹³C distribution of greater than−32%. For some fuel products described herein, the fuel component is afuel additive which may be MTBE, an anti-oxidant, an antistatic agent, acorrosion inhibitor, or any combination thereof. A fuel product asdescribed herein may be a product generated by blending a fuel productas described and a fuel component. In some embodiments, the fuel producthas a δ¹³C distribution of greater than −32%. In other embodiments, thefuel product has a δ¹³C distribution of less than −32%. For example, anoil composition extracted from an organism can be blended with a fuelcomponent prior to refining (for example, cracking) in order to generatea fuel product as described herein. A fuel component, can be a fossilfuel, or a mixing blend for generating a fuel product. For example, amixture for fuel blending may be a hydrocarbon mixture that is suitablefor blending with another hydrocarbon mixture to generate a fuelproduct. For example, a mixture of light alkanes may not have a certainoctane number to be suitable for a type of fuel, however, it can beblended with a high octane mixture to generate a fuel product. Inanother example, a composition with a δ¹³C distribution of less than−32% is blended with a hydrocarbon mixture for fuel blending to create afuel product. In some embodiments, the composition or fuel componentalone are not suitable as a fuel product, however, when combined, theyare useful as a fuel product. In other embodiments, either thecomposition or the fuel component or both individually are suitable as afuel product., in yet another embodiment, the fuel component is anexisting petroleum product, such as gasoline or jet fuel. In otherembodiments, the fuel component is derived from a renewable resource,such as bioethanol, biodiesel, and biogasoline.

Oil compositions, derived from biomass obtained from a host cell, can beused for producing high-octane hydrocarbon products. Thus, oneembodiment describes a method of forming a fuel product, comprising:obtaining an upgraded oil composition, cracking the oil composition, andblending the resulting one or more light hydrocarbons, having 4 to 12carbons and an Octane number of 80 or higher, with a hydrocarbon havingan Octane number of 80 or less. The hydrocarbons having an Octane numberof 80 or less are, for example, fossil fuels derived from refining crudeoil.

The biomass feedstock obtained from a host organism can be modified ortagged such that the light hydrocarbon products can be identified ortraced back to their original feedstock. For example, carbon isotopescan be introduced into a biomass hydrocarbon in the course of itsbiosynthesis. The tagged hydrocarbon feedstock can be subjected to therefining processes described herein to produce a light hydrocarbonproduct tagged with a carbon isotope. The isotopes allow for theidentification of the tagged products, either alone or in combinationwith other untagged products, such that the tagged products can betraced back to their original biomass feedstocks.

TABLE A Examples of Enzymes Involved in the Isoprenoid Pathway SynthaseSource NCBI protein ID Limonene M. spicata 2ONH_A Cineole S. officinalisAAC26016 Pinene A. grandis AAK83564 Camphene A. grandis AAB70707Sabinene S. officinalis AAC26018 Myrcene A. grandis AAB71084 AbietadieneA. grandis Q38710 Taxadiene T. brevifolia AAK83566 FPP G. gallus P08836Amorphadiene A. annua AAF61439 Bisabolene A. grandis O81086Diapophytoene S. aureus Diapophytoene desaturase S. aureus GPPS-LSU M.spicata AAF08793 GPPS-SSU M. spicata AAF08792 GPPS A. thaliana CAC16849GPPS C. reinhardtii EDP05515 FPP E. coli NP_414955 FPP A. thalianaNP_199588 FPP A. thaliana NP_193452 FPP C. reinhardtii EDP03194 IPPisomerase E. coli NP_417365 IPP isomerase H. pluvialis ABB80114 LimoneneL. angustifolia ABB73044 Monoterpene S. lycopersicum AAX69064Terpinolene O. basilicum AAV63792 Myrcene O. basilicum AAV63791Zingiberene O. basilicum AAV63788 Myrcene Q. ilex CAC41012 Myrcene P.abies AAS47696 Myrcene, ocimene A. thaliana NP_179998 Myrcene, ocimeneA. thaliana NP_567511 Sesquiterpene Z. mays; B73 AAS88571 SesquiterpeneA. thaliana NP_199276 Sesquiterpene A. thaliana NP_193064 SesquiterpeneA. thaliana NP_193066 Curcumene P. cablin AAS86319 Farnesene M.domestica AAX19772 Farnesene C. sativus AAU05951 Farnesene C. junosAAK54279 Farnesene P. abies AAS47697 Bisabolene P. abies AAS47689Sesquiterpene A. thaliana NP_197784 Sesquiterpene A. thailana NP_175313GPP Chimera GPPS-LSU + SSU fusion Geranylgeranyl reductase A. thailanaNP_177587 Geranylgeranyl reductase C. reinhardtii EDP09986Chlorophyllidohydrolase C. reinhardtii EDP01364 ChlorophyllidohydrolaseA. thaliana NP_564094 Chlorophyllidohydrolase A. thaliana NP_199199Phosphatase S. cerevisiae AAB64930 FPP A118W G. gallus

Codon Optimization

As discussed above, one or more codons of an encoding polynucleotide canbe “biased” or “optimized” to reflect the codon usage of the hostorganism. For example, one or more codons of an encoding polynucleotidecan be “biased” or “optimized” to reflect chloroplast codon usage (TableB) or nuclear codon usage (Table C). Most amino acids are encoded by twoor more different (degenerate) codons, and it is well recognized thatvarious organisms utilize certain codons in preference to others.“Biased” or codon “optimized” can be used interchangeably throughout thespecification. Codon bias can be variously skewed in different plants,including, for example, in alga as compared to tobacco. Generally, thecodon bias selected reflects codon usage of the plant (or organelletherein) which is being transformed with the nucleic acids of thepresent disclosure.

A polynucleotide that is biased for a particular codon usage can besynthesized de novo, or can be genetically modified using routinerecombinant DNA techniques, for example, by a site directed mutagenesismethod, to change one or more codons such that they are biased forchloroplast codon usage.

Such preferential codon usage, which is utilized in chloroplasts, isreferred to herein as “chloroplast codon usage.” Table B (below) showsthe chloroplast codon usage for C. reinhardtii (see U.S. PatentApplication Publication No.: 2004/0014174, published Jan. 22, 2004).

TABLE B Chloroplast Codon Usage in Chlamydomonas reinhardtii UUU 34.1*(348**) UCU 19.4 (198) UAU 23.7 (242) UGU 8.5 (87) UUC 14.2 (145) UCC4.9 (50) UAC 10.4 (106) UGC 2.6 (27) UUA 72.8 (742) UCA 20.4 (208) UAA2.7 (28) UGA 0.1 (1) UUG 5.6 (57) UCG 5.2 (53) UAG 0.7 (7) UGG 13.7(140) CUU 14.8 (151) CCU 14.9 (152) CAU 11.1 (113) CGU 25.5 (260) CUC1.0 (10) CCC 5.4 (55) CAC 8.4 (86) CGC 5.1 (52) CUA 6.8 (69) CCA 19.3(197) CAA 34.8 (355) CGA 3.8 (39) CUG 7.2 (73) CCG 3.0 (31) CAG 5.4 (55)CGG 0.5 (5) AUU 44.6 (455) ACU 23.3 (237) AAU 44.0 (449) AGU 16.9 (172)AUC 9.7 (99) ACC 7.8 (80) AAC 19.7 (201) AGC 6.7 (68) AUA 8.2 (84) ACA29.3 (299) AAA 61.5 (627) AGA 5.0 (51) AUG 23.3 (238) ACG 4.2 (43) AAG11.0 (112) AGG 1.5 (15) GUU 27.5 (280) GCU 30.6 (312) GAU 23.8 (243) GGU40.0 (408) GUC 4.6 (47) GCC 11.1 (113) GAC 11.6 (118) GGC 8.7 (89) GUA26.4 (269) GCA 19.9 (203) GAA 40.3 (411) GGA 9.6 (98) GUG 7.1 (72) GCG4.3 (44) GAG 6.9 (70) GGG 4.3 (44) *Frequency of codon usage per 1,000codons. **Number of times observed in 36 chloroplast coding sequences(10,193 codons).

The chloroplast codon bias can, but need not, be selected based on aparticular organism in which a synthetic polynucleotide is to beexpressed. The manipulation can be a change to a codon, for example, bya method such as site directed mutagenesis, by a method such as PCRusing a primer that is mismatched for the nucleotide(s) to be changedsuch that the amplification product is biased to reflect chloroplastcodon usage, or can be the de novo synthesis of polynucleotide sequencesuch that the change (bias) is introduced as a consequence of thesynthesis procedure.

In addition to utilizing chloroplast codon bias as a means to provideefficient translation of a polypeptide, it will be recognized that analternative means for obtaining efficient translation of a polypeptidein a chloroplast is to re-engineer the chloroplast genome (e.g., a C.reinhardtii chloroplast genome) for the expression of tRNAs nototherwise expressed in the chloroplast genome. Such an engineered algaeexpressing one or more exogenous tRNA molecules provides the advantagethat it would obviate a requirement to modify every polynucleotide ofinterest that is to be introduced into and expressed from a chloroplastgenome; instead, algae such as C. reinhardtii that comprise agenetically modified chloroplast genome can be provided and utilized forefficient translation of a polypeptide according to any method of thedisclosure. Correlations between tRNA abundance and codon usage inhighly expressed genes is well known (for example, as described inFranklin et al., Plant J. 30:733-744, 2002; Dong et al., J. Mol. Biol.260:649-663, 1996; Duret, Trends Genet. 16:287-289, 2000; Goldman et.al., J. Mol. Biol. 245:467-473, 1995; and Komar et. al., Biol. Chem.379:1295-1300, 1998). In E. coli, for example, re-engineering of strainsto express underutilized tRNAs resulted in enhanced expression of geneswhich utilize these codons (see Novy et al., in Novations 12:1-3, 2001).Utilizing endogenous tRNA genes, site directed mutagenesis can be usedto make a synthetic tRNA gene, which can be introduced into chloroplaststo complement rare or unused tRNA genes in a chloroplast genome, such asa C. reinhardtii chloroplast genome.

Generally, the chloroplast codon bias selected for purposes of thepresent disclosure, including, for example, in preparing a syntheticpolynucleotide as disclosed herein reflects chloroplast codon usage of aplant chloroplast, and includes a codon bias that, with respect to thethird position of a codon, is skewed towards A/T, for example, where thethird position has greater than about 66% AT bias, or greater than about70% AT bias. In one embodiment, the chloroplast codon usage is biased toreflect alga chloroplast codon usage, for example, C. reinhardtii, whichhas about 74.6% AT bias in the third codon position. Preferred codonusage in the chloroplasts of algae has been described in US2004/0014174.

Table C exemplifies codons that are preferentially used in algal nucleargenes. The nuclear codon bias can, but need not, be selected based on aparticular organism in which a synthetic polynucleotide is to beexpressed. The manipulation can be a change to a codon, for example, bya method such as site directed mutagenesis, by a method such as PCRusing a primer that is mismatched for the nucleotide(s) to be changedsuch that the amplification product is biased to reflect nuclear codonusage, or can be the de novo synthesis of polynucleotide sequence suchthat the change (bias) is introduced as a consequence of the synthesisprocedure.

In addition to utilizing nuclear codon bias as a means to provideefficient translation of a polypeptide, it will be recognized that analternative means for obtaining efficient translation of a polypeptidein a nucleus is to re-engineer the nuclear genome (e.g., a C.reinhardtii nuclear genome) for the expression of tRNAs not otherwiseexpressed in the nuclear genome. Such an engineered algae expressing oneor more exogenous tRNA molecules provides the advantage that it wouldobviate a requirement to modify every polynucleotide of interest that isto be introduced into and expressed from a nuclear genome; instead,algae such as C. reinhardtii that comprise a genetically modifiednuclear genome can be provided and utilized for efficient translation ofa polypeptide according to any method of the disclosure. Correlationsbetween tRNA abundance and codon usage in highly expressed genes is wellknown (for example, as described in Franklin et al., Plant J.30:733-744, 2002; Dong et al., J. Mol. Biol. 260:649-663, 1996; Duret,Trends Genet. 16:287-289, 2000; Goldman et. Al., J. Mol. Biol.245:467-473, 1995; and Komar et. Al., Biol. Chem. 379:1295-1300, 1998).In E. coli, for example, re-engineering of strains to expressunderutilized tRNAs resulted in enhanced expression of genes whichutilize these codons (see Novy et al., in Novations 12:1-3, 2001).Utilizing endogenous tRNA genes, site directed mutagenesis can be usedto make a synthetic tRNA gene, which can be introduced into the nucleusto complement rare or unused tRNA genes in a nuclear genome, such as aC. reinhardtii nuclear genome.

Generally, the nuclear codon bias selected for purposes of the presentdisclosure, including, for example, in preparing a syntheticpolynucleotide as disclosed herein, can reflect nuclear codon usage ofan algal nucleus and includes a codon bias that results in the codingsequence containing greater than 60% G/C content.

TABLE C fields: [triplet] [frequency: per thousand] ([number]) Coding GC66.30% 1^(st) letter GC 64.80% 2^(nd) letter GC 47.90% 3^(rd) letter GC86.21% Nuclear Codon Usage in Chlamydomonas reinhardtii UUU 5.0 (2110)UCU 4.7 (1992) UAU 2.6 (1085) UGU 1.4 (601) UUC 27.1 (11411) UCC 16.1(6782) UAC 22.8 (9579) UGC 13.1 (5498) UUA 0.6 (247) UCA 3.2 (1348) UAA1.0 (441) UGA 0.5 (227) UUG 4.0 (1673) UCG 16.1 (6763) UAG 0.4 (183) UGG13.2 (5559) CUU 4.4 (1869) CCU 8.1 (3416) CAU 2.2 (919) CGU 4.9 (2071)CUC 13.0 (5480) CCC 29.5 (12409) CAC 17.2 (7252) CGC 34.9 (14676) CUA2.6 (1086) CCA 5.1 (2124) CAA 4.2 (1780) CGA 2.0 (841) CUG 65.2 (27420)CCG 20.7 (8684) CAG 36.3 (15283) CGG 11.2 (4711) AUU 8.0 (3360) ACU 5.2(2171) AAU 2.8 (1157) AGU 2.6 (1089) AUC 26.6 (11200) ACC 27.7 (11663)AAC 28.5 (11977) AGC 22.8 (9590) AUA 1.1 (443) ACA 4.1 (1713) AAA 2.4(1028) AGA 0.7 (287) 0AUG 25.7 (10796) ACG 15.9 (6684) AAG 43.3 (18212)AGG 2.7 (1150) GUU 5.1 (2158) GCU 16.7 (7030) GAU 6.7 (2805) GGU 9.5(3984) GUC 15.4 (6496) GCC 54.6 (22960) GAC 41.7 (17519) GGC 62.0(26064) GUA 2.0 (857) GCA 10.6 (4467) GAA 2.8 (1172) GGA 5.0 (2084) GUG46.5 (19558) GCG 44.4 (18688) GAG 53.5 (22486) GGG 9.7 (4087)

Table D lists the codon selected at each position for backtranslatingthe protein to a DNA sequence for synthesis. The selected codon is thesequence recognized by the tRNA encoded in the chloroplast genome whenpresent; the stop codon (TAA) is the codon most frequently present inthe chloroplast encoded genes. If an undesired restriction site iscreated, the next best choice according to the regular Chlamydomonaschloroplast usage table that eliminates the restriction site isselected.

TABLE D Amino acid Codon utilized F TTC L TTA I ATC V GTA S TCA P CCA TACA A GCA Y TAC H CAC Q CAA N AAC K AAA D GAC E GAA C TGC R CGT G GGC WTGG M ATG STOP TAA

Percent Sequence Identity

One example of an algorithm that is suitable for determining percentsequence identity or sequence similarity between nucleic acid orpolypeptide sequences is the BLAST algorithm, which is described, e.g.,in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software forperforming BLAST analysis is publicly available through the NationalCenter for Biotechnology Information. The BLAST algorithm parameters W,T, and X determine the sensitivity and speed of the alignment. TheBLASTN program (for nucleotide sequences) uses as defaults a word length(W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=4, and acomparison of both strands. For amino acid sequences, the BLASTP programuses as defaults a word length (W) of 3, an expectation (E) of 10, andthe BLOSUM62 scoring matrix (as described, for example, in Henikoff &Henikoff (1989) Proc. Natl. Acad. Sci. USA, 89:10915). In addition tocalculating percent sequence identity, the BLAST algorithm also canperform a statistical analysis of the similarity between two sequences(for example, as described in Karlin & Altschul, Proc. Nat'l. Acad. Sci.USA, 90:5873-5787 (1993)). One measure of similarity provided by theBLAST algorithm is the smallest sum probability (P(N)), which providesan indication of the probability by which a match between two nucleotideor amino acid sequences would occur by chance. For example, a nucleicacid is considered similar to a reference sequence if the smallest sumprobability in a comparison of the test nucleic acid to the referencenucleic acid is less than about 0.1, less than about 0.01, or less thanabout 0.001.

Fatty Acids and Glycerol Lipids

The present disclosure describes host cells capable of makingpolypeptides that contribute to the accumulation and/or secretion offatty acids, glycerol lipids, or oils, by transforming host cells (e.g.,alga cells such as C. reinhardtii, D. salina, H. pluvalis, andcyanobacterial cells) with nucleic acids encoding one or more differentenzymes. Examples of such enzymes include acetyl-CoA carboxylase,ketoreductase, thioesterase, malonyltransferase, dehydratase, acyl-CoAligase, ketoacylsynthase, enoylreductase, and desaturase. The enzymescan be, for example, catabolic or biodegrading enzymes.

In some instances, the host cell will naturally produce the fatty acid,glycerol lipid, triglyceride, or oil of interest. Therefore,transformation of the host cell with a polynucleotide encoding anenzyme, for example an ACCase, will allow for the increased activity ofthe enzyme and/or increased accumulation and/or secretion of a moleculeof interest (e.g., a lipid) in the cell.

A change in the accumulation and/or secretion of a desired product, forexample, fatty acids, glycerol lipids, or oils, by a transformed hostcell can include, for example, a change in the total oil content overthat normally present in the cell, or a change in the type of oil thatis normally present in the cell.

Some host cells may be transformed with multiple genes encoding one ormore enzymes. For example, a single transformed cell may containexogenous nucleic acids encoding enzymes that make up an entireglycerolipid synthesis pathway. One example of a pathway might includegenes encoding an acetyl CoA carboxylase, a malonyltransferase, aketoacylsynthase, and a thioesterase. Cells transformed with an entirepathway and/or enzymes extracted from those cells, can synthesize, forexample, complete fatty acids or intermediates of the fatty acidsynthesis pathway. Constructs may contain, for example, multiple copiesof the same gene, multiple genes encoding the same enzyme from differentorganisms, and/or multiple genes with one or more mutations in thecoding sequence(s).

The enzyme(s) produced by the modified cells may result in theproduction of fatty acids, glycerol lipids, triglycerides, or oils thatmay be collected from the cells and, or the surrounding environment(e.g., bioreactor or growth medium). In some embodiments, the collectionof the fatty acids, glycerol lipids, triglycerides, or oils is performedafter the product is secreted from the cell via a cell membranetransporter.

Examples of candidate Chlamydomonas genes encoding enzymes ofglycerolipid metabolism that can be used in the described embodimentsare described in The Chlamydomonas Sourcebook Second Edition, Organellarand Metabolic Processes, Vol. 2, pp. 41-68, David B. Stern (Ed.),(2009), Elsevier Academic Press.

For example, enzymes involved in plastid, mitochondrial, and cytosolicpathways, along with plastidic and cytosolic isoforms of fatty aciddesaturases, and triglyceride synthesis enzymes are described (and theiraccession numbers provided). An exemplary chart of some of the genesdescribed is provided below:

Acyl-ACP thioesterase FAT1 EDP08596 Long-chain acyl-CoA synthetase LCS1EDO96800 CDP-DAG: Inositol phosphotransferase PIS1 EDP06395 Acyl-CoA:Diacylglycerol acyltransferase DGA1 EDO96893 Phospholipid:Diacylglycerol LRO1(LCA1) EDP07444 acyltransferase

Examples of the types of fatty acids and/or glycerol lipids that a hostcell or organism can produce, are described below.

Lipids are a broad group of naturally occurring molecules which includesfats, waxes, sterols, fat-soluble vitamins (such as vitamins A, D, E andK), monoglycerides, diglycerides, phospholipids, and others. The mainbiological functions of lipids include energy storage, as structuralcomponents of cell membranes, and as important signaling molecules.

Lipids may be broadly defined as hydrophobic or amphiphilic smallmolecules; the amphiphilic nature of some lipids allows them to formstructures such as vesicles, liposomes, or membranes in an aqueousenvironment. Biological lipids originate entirely or in part from twodistinct types of biochemical subunits or “building blocks”: ketoacyland isoprene groups. Lipids may be divided into eight categories: fattyacyls, glycerolipids, glycerophospholipids, sphingolipids,saccharolipids and polyketides (derived from condensation of ketoacylsubunits); and sterol lipids and prenol lipids (derived fromcondensation of isoprene subunits). For this disclosure, saccharolipidswill not be discussed.

Fats are a subgroup of lipids called triglycerides. Lipids alsoencompass molecules such as fatty acids and their derivatives (includingtri-, di-, and monoglycerides and phospholipids), as well as othersterol-containing metabolites such as cholesterol. Humans and othermammals use various biosynthetic pathways to both break down andsynthesize lipids.

Fatty Acyls

Fatty acyls, a generic term for describing fatty acids, their conjugatesand derivatives, are a diverse group of molecules synthesized bychain-elongation of an acetyl-CoA primer with malonyl-CoA ormethylmalonyl-CoA groups in a process called fatty acid synthesis. Afatty acid is any of the aliphatic monocarboxylic acids that can beliberated by hydrolysis from naturally occurring fats and oils. They aremade of a hydrocarbon chain that terminates with a carboxylic acidgroup; this arrangement confers the molecule with a polar, hydrophilicend, and a nonpolar, hydrophobic end that is insoluble in water. Thefatty acid structure is one of the most fundamental categories ofbiological lipids, and is commonly used as a building block of morestructurally complex lipids. The carbon chain, typically between four to24 carbons long, may be saturated or unsaturated, and may be attached tofunctional groups containing oxygen, halogens, nitrogen and sulfur;branched fatty acids and hydroxyl fatty acids also occur, and very longchain acids of over 30 carbons are found in waxes. Where a double bondexists, there is the possibility of either a cis or trans geometricisomerism, which significantly affects the molecule's molecularconfiguration. Cis-double bonds cause the fatty acid chain to bend, aneffect that is more pronounced the more double bonds there are in achain. This in turn plays an important role in the structure andfunction of cell membranes. Most naturally occurring fatty acids are ofthe cis configuration, although the trans form does exist in somenatural and partially hydrogenated fats and oils.

Examples of biologically important fatty acids are the eicosanoids,derived primarily from arachidonic acid and eicosapentaenoic acid, whichinclude prostaglandins, leukotrienes, and thromboxanes. Other majorlipid classes in the fatty acid category are the fatty esters and fattyamides. Fatty esters include important biochemical intermediates such aswax esters, fatty acid thioester coenzyme A derivatives, fatty acidthioester ACP derivatives and fatty acid carnitines. The fatty amidesinclude N-acyl ethanolamines.

Glycerolipids

Glycerolipids are composed mainly of mono-, di- and tri-substitutedglycerols, the most well-known being the fatty acid esters of glycerol(triacylglycerols), also known as triglycerides. In these compounds, thethree hydroxyl groups of glycerol are each esterified, usually bydifferent fatty acids. Because they function as a food store, theselipids comprise the bulk of storage fat in animal tissues. Thehydrolysis of the ester bonds of triacylglycerols and the release ofglycerol and fatty acids from adipose tissue is called fat mobilization.

Additional subclasses of glycerolipids are represented byglycosylglycerols, which are characterized by the presence of one ormore sugar residues attached to glycerol via a glycosidic linkage. Anexample of a structure in this category is thedigalactosyldiacylglycerols found in plant membranes.

Exemplary Chlamydomonas glycerolipids include: DGDG,digalactosyldiacylglycerol; DGTS,diacylglyceryl-N,N,N-trimethylhomoserine; MGDG,monogalactosyldiacylglycerol; PtdEtn, phosphatidylethanolamine; PtdGro,phosphatidylglycerol; PtdIns, phosphatidylinositol; SQDG,sulfoquinovosyldiacylglycerol; and TAG, triacylglycerol.

Glycerophospholipids

Glycerophospholipids are any derivative of glycerophosphoric acid thatcontains at least one O-acyl, O-alkyl, or O-alkenyl group attached tothe glycerol residue. The common glycerophospholipids are named asderivatives of phosphatidic acid (phosphatidyl choline, phosphatidylserine, and phosphatidyl ethanolamine).

Glycerophospholipids, also referred to as phospholipids, are ubiquitousin nature and are key components of the lipid bilayer of cells, as wellas being involved in metabolism and cell signaling. Glycerophospholipidsmay be subdivided into distinct classes, based on the nature of thepolar headgroup at the sn-3 position of the glycerol backbone ineukaryotes and eubacteria, or the sn-1 position in the case ofarchaebacteria.

Examples of glycerophospholipids found in biological membranes arephosphatidylcholine (also known as PC, GPCho or lecithin),phosphatidylethanolamine (PE or GPEtn) and phosphatidylserine (PS orGPSer). In addition to serving as a primary component of cellularmembranes and binding sites for intra- and intercellular proteins, someglycerophospholipids in eukaryotic cells, such as phosphatidylinositolsand phosphatidic acids are either precursors of, or are themselves,membrane-derived second messengers. Typically, one or both of thesehydroxyl groups are acylated with long-chain fatty acids, but there arealso alkyl-linked and 1Z-alkenyl-linked (plasmalogen)glycerophospholipids, as well as dialkylether variants inarchaebacteria.

Sphingolipids

Sphingolipids are any of class of lipids containing the long-chain aminodiol, sphingosine, or a closely related base (i.e. a sphingoid). A fattyacid is bound in an amide linkage to the amino group and the terminalhydroxyl may be linked to a number of residues such as a phosphate esteror a carbohydrate. The predominant base in animals is sphingosine whilein plants it is phytosphingosine.

The main classes are: (1) phosphosphigolipids (also known assphingophospholipids), of which the main representative issphingomyelin; and (2) glycosphingolipids, which contain at least onemonosaccharide and a sphingoid, and include the cerebrosides andgangliosides. Sphingolipids play an important structural role in cellmembranes and may be involved in the regulation of protein kinase C.

As mentioned above, sphingolipids are a complex family of compounds thatshare a common structural feature, a sphingoid base backbone, and aresynthesized de novo from the amino acid serine and a long-chain fattyacyl CoA, that are then converted into ceramides, phosphosphingolipids,glycosphingolipids and other compounds. The major sphingoid base ofmammals is commonly referred to as sphingosine. Ceramides(N-acyl-sphingoid bases) are a major subclass of sphingoid basederivatives with an amide-linked fatty acid. The fatty acids aretypically saturated or mono-unsaturated with chain lengths from 16 to 26carbon atoms.

The major phosphosphingolipids of mammals are sphingomyelins (ceramidephosphocholines), whereas insects contain mainly ceramidephosphoethanolamines, and fungi have phytoceramide phosphoinositols andmannose-containing headgroups. The glycosphingolipids are a diversefamily of molecules composed of one or more sugar residues linked via aglycosidic bond to the sphingoid base. Examples of these are the simpleand complex glycosphingolipids such as cerebrosides and gangliosides.

Sterol Lipids

Sterol lipids, such as cholesterol and its derivatives, are an importantcomponent of membrane lipids, along with the glycerophospholipids andsphingomyelins. The steroids, all derived from the same fused four-ringcore structure, have different biological roles as hormones andsignaling molecules. The eighteen-carbon (C18) steroids include theestrogen family whereas the C19 steroids comprise the androgens such astestosterone and androsterone. The C21 subclass includes theprogestogens as well as the glucocorticoids and mineralocorticoids. Thesecosteroids, comprising various forms of vitamin D, are characterizedby cleavage of the B ring of the core structure. Other examples ofsterols are the bile acids and their conjugates, which in mammals areoxidized derivatives of cholesterol and are synthesized in the liver.The plant equivalents are the phytosterols, such as β-sitosterol,stigmasterol, and brassicasterol; the latter compound is also used as abiomarker for algal growth. The predominant sterol in fungal cellmembranes is ergosterol.

Prenol Lipids

Prenol lipids are synthesized from the 5-carbon precursors isopentenyldiphosphate and dimethylallyl diphosphate that are produced mainly viathe mevalonic acid (MVA) pathway. The simple isoprenoids (for example,linear alcohols and diphosphates) are formed by the successive additionof C5 units, and are classified according to the number of these terpeneunits. Structures containing greater than 40 carbons are known aspolyterpenes. Carotenoids are important simple isoprenoids that functionas antioxidants and as precursors of vitamin A. Another biologicallyimportant class of molecules is exemplified by the quinones andhydroquinones, which contain an isoprenoid tail attached to a quinonoidcore of non-isoprenoid origin. Prokaryotes synthesize polyprenols(called bactoprenols) in which the terminal isoprenoid unit attached tooxygen remains unsaturated, whereas in animal polyprenols (dolichols)the terminal isoprenoid is reduced.

Polyketides

Polyketides or sometimes acetogenin are any of a diverse group ofnatural products synthesized via linear poly-β-ketones, which arethemselves formed by repetitive head-to-tail addition of acetyl (orsubstituted acetyl) units indirectly derived from acetate (or asubstituted acetate) by a mechanism similar to that for fatty-acidbiosynthesis but without the intermediate reductive steps. In many case,acetyl-CoA functions as the starter unit and malonyl-CoA as theextending unit. Various molecules other than acetyl-CoA may be used asstarter, often with methoylmalonyl-CoA as the extending unit. Thepoly-β-ketones so formed may undergo a variety of further types ofreactions, which include alkylation, cyclization, glycosylation,oxidation, and reduction. The classes of product formed—and theircorresponding starter substances—comprise inter alia: coniine (ofhemlock) and orsellinate (of lichens)—acetyl-CoA; flavanoids andstilbenes—cinnamoyl-CoA; tetracyclines—amide of malonyl-CoA; urushiols(of poison ivy)—palmitoleoyl-CoA; and erythonolides—propionyl-CoA andmethyl-malonyl-CoA as extender.

Polyketides comprise a large number of secondary metabolites and naturalproducts from animal, plant, bacterial, fungal and marine sources, andhave great structural diversity. Many polyketides are cyclic moleculeswhose backbones are often further modified by glycosylation,methylation, hydroxylation, oxidation, and/or other processes. Manycommonly used anti-microbial, anti-parasitic, and anti-cancer agents arepolyketides or polyketide derivatives, such as erythromycins,tetracyclines, avermectins, and antitumor epothilones.

The following examples are intended to provide illustrations of theapplication of the present disclosure. The following examples are notintended to completely define or otherwise limit the scope of thedisclosure. One of skill in the art will appreciate that many othermethods known in the art may be substituted in lieu of the onesspecifically described or referenced herein.

EXAMPLES Example 1 Generating the Library and Isolation of CandidateStrains

In this example, an insertional mutagenesis library was generated toisolate candidates resistant to high concentrations of Sodium Chloride.All DNA manipulations carried out in this example were essentially asdescribed by Sambrook et al., Molecular Cloning: A Laboratory Manual(Cold Spring Harbor Laboratory Press 1989) and Cohen et al., Meth.Enzymol. 297, 192-208, 1998.

Transforming DNA, the SENuc146 plasmid shown in FIG. 8, was created byusing pBluescript II SK(−) (Agilent Technologies, CA) as a vectorbackbone. The segment labeled Aph 7″ is the hygronmycin resistance genefrom Streptomyces hygroscopicus. The first intron from the Chlamydomonasreinhardtii rbcS2 gene is cloned into Aph 7″ in order to increaseexpression levels and consequentially, the number of transformants(Berthold et al. Protist 153:401-412 (2002)). Aph 7″ is preceded by theChlamydomonas reinhardtii β2-tubulin promoter and is followed by theChlamydomonas reinhardtii rbcS2 terminator. Subsequently, the segmentlabeled “Hybrid Promoter” indicates a fused promoter region beginningwith the C. reinhardtii Hsp70A promoter, C. reinhardtii rbcS2 promoter,and the first intron from the C. reinhardtii rbcS2 gene (Sizova et al.Gene, 277:221-229 (2001)). The SENuc140 plasmid (FIG. 9) was created bysubstituting Aph 7″ cassette with the gene encoding theaminoglycoside-O-phosphotransferase VIII (Aph VIII) from Streptomycesrimosus flanked by the promoter and terminator of the C. reinhardtiipsaD gene. Expression of Aph VIII confers resistance to the antibioticparomomycin and has been shown to yield large numbers of transformants(Sizova et al. Gene, 181:13-18 (1996)).

Transformation DNA was prepared by digesting either SENuc 146 or SENuc140 with the restriction enzymes NotI and NdeI followed by DNA gelpurification to separate the selectable marker cassette from thebackbone vector. For these experiments, all transformations were carriedout on C. reinhardtii cc1690 (mt+). Cells were grown and transformed viaelectroporation. Cells were grown to mid-log phase (approximately2-6×10⁶ cells/ml) in TAP media. Cells were spun down at between 2000×gand 5000×g for 5 min. The supernatant was removed and the cells wereresuspended in TAP media+40 mM sucrose. 250 ng (in 1-5 μL H₂O) oftransformation DNA was mixed with 250 μL of 3×10⁸ cells/mL on ice andtransferred to 0.4 cm electroporation cuvettes. In order to generate asufficient number of transformants, at least 50 transformation reactionswere set up. Electroporation was performed with the capacitance set at25 uF, the voltage at 800 V to deliver 2000 V/cm resulting in a timeconstant of approximately 10-14 ms. Following electroporation, thecuvette was returned to room temperature for 5-20 min. For eachtransformation, cells were transferred to 10 ml of TAP media+40 mMsucrose and allowed to recover at room temperature for 12-16 hours withcontinuous shaking. Cells were then harvested by centrifugation atbetween 2000×g and 5000×g, the supernatant was discarded, and the pelletwas resuspended in 0.5 ml TAP media+40 mM sucrose. The resuspended cellswere then plated on solid TAP media+20 μg/mL hygromycin or solid TAPmedia+20 μg/mL paromomycin. 50 transformations, using a total of 12.5 μgof purified transformation DNA, would typically yield approximately200,000 individual transformants.

Transformants were then scraped into 1 L liquid TAP media and allowed torecover at room temperature for 48 hours with continuous agitation.After one to two days of the library recovering in TAP media, a celldensity count was taken. In order to ensure full coverage of thelibrary, 10× of the library size was needed. For example, if the librarysize was 2×10⁵ transformants, then 2×10⁶ cells were carried on forselection.

As indicated above, 10× of the library size was spun down in triplicateat 3000×g for 5 minutes. The pellets were washed 3 times with 50 mL ofG₀ media. After the washes, the pellets were resuspended in 10 mL ofliquid G₀ media and plated on Bioassay Trays (Nunc catalog number240835) containing solid G₀ media+75 mM NaCl, G₀+100 mM NaCl, and G₀+125mM NaCl. G₀ media is composed of 0.07 mM FeCl₃, 11.71 mM Na₂EDTA, 0.0002mM CoCl₂, 0.0003 mM ZnSO₄, 0.0001 mM CuSO₄, 0.0035 mM MnCl₂, 0.0001 mMNa₂MoO₄, 1.42 mM NaNO₃, 0.21 mM NaH₂PO₄, 0.003 mM ThiamineHydrochloride, 0.0000019 mM Vitamin B₁₂, 0.0000106 mM Biotin, 0.406 mMMgSO₄□7H₂0, 0.0476 mM CaCl₂□2H₂0, 0.162 mM H₃BO₃, 0.00710 mM NaVO₃, 5.95mM NaHCO₃. In addition, parallel liquid assays were performed using thesame range of NaCl concentrations (data not shown). Plates were thenplaced at room temperature in high light in a box fed with 5% CO₂.Colonies tolerant to increased NaCl appeared about 10 to 14 days later.Colonies were struck out on solid G₀ media for single colonies to ensureclonality. Single colonies were then picked into liquid G₀ media forsecondary screening.

Candidates were plated onto solid G₀ media+75 mM NaCl, G₀+100 mM NaCl,and G₀+125 mM NaCl. Candidates were also inoculated 1:100 (v:v) intoliquid G₀ media+75 mM NaCl, G₀+100 mM NaCl, and G₀+125 mM NaCl. Thisprocess was utilized both to confirm the phenotype, but also toqualitatively rank order the candidates by level of resistance.Confirmed candidates were carried forward for identification andvalidation (Table 1).

Example 2 Segregation Analysis of Candidate Strains

Segregation analysis was another method to validate that the randominsertion of the exogenous DNA containing a selectable marker conferringantibiotic resistance is genetically linked to the observed phenotype.The mating type + and mating type − of Chlamydomonas reinhardtii can becrossed. The S7 candidate strain (mating type +) was crossed with C.reinhardtii cc1691 (mating type −) by growing both separately on solidTAP media for 5-7 days at room temperature and high light. Cells wereresuspended in nitrogen-free liquid TAP media for 2 hours under light.200 μl of both S7 and cc1691 were mixed and left for at least 2 hours tomate. Cells were plated on solid HSM media and grown overnight underlight and subsequently stored in the dark for 3 days. Chloroform vaportreatment was applied for 30 seconds to eliminate gametes. The plate wasplaced under light for approximately one week to allow the zygote togerminate. Clonal colonies were obtained by serial dilution.

5 colonies that were resistant to hygromycin and 5 colonies that weresensitive to hygromycin were inoculated into liquid G₀ media and liquidG₀ media+75 mM NaCl. The results shown in FIG. 41 demonstrate that thephenotype segregates with the antibiotic resistance. This validates thatthe phenotype is physically linked with the gene disruption.

Example 3 Identification of Candidate Strains and miRNA KnockdownAnalysis

In this example, the identity of the gene disruption of all candidatestrains that were resistant to 75 mM-125 mM NaCl was determined.Subsequently, artificial miRNAs were designed to knockdown theidentified gene to reproduce the phenotype as a means of validation. AllDNA manipulations carried out in this example were essentially asdescribed by Sambrook et al., Molecular Cloning: A Laboratory Manual(Cold Spring Harbor Laboratory Press 1989) and Cohen et al., Meth.Enzymol. 297, 192-208, 1998.

Identification

Candidate strains confirmed with the desired phenotype were grown onsolid TAP media+20 μg/mL hygromycin or solid TAP media+20 μg/mLparomomycin depending on the transformation DNA used. Approximately 5 mLof a saturated culture was processed to isolate genomic DNA. Genomic DNAwas isolated from individual mutants (colonies), using the PromegaWizard Genomic DNA Purification Kit (Promega Cat. #A1125). The procedurefor “Isolation of Genomic DNA from Plant Tissue” outlined in thetechnical manual for the kit was followed. Results from identificationare summarized in Table 1. Genome walking encompasses many methods, eachresulting in limited success, that have been used to identify the DNAsequence flanking a region of known identity. Three main methods wereutilized to maximize the success rate of identification. The methods aredescribed.

Adaptor Ligation Method or Cassette PCR Adaptor

500 ng-1 μg genomic DNA of a candidate strain was digested with bluntend restriction enzymes (PmII and PvuII) as recommended by themanufacturer (NEB). Digested genomic DNA was purified with PromegaWizard DNA Clean-up system (Promega Cat. #A7280). In order to generatethe adaptor, both adaptor primers (see Table 3 for SEQ ID NO: 88 and SEQID NO: 89) were resuspended in STE buffer (10 mM Tris-HCl pH 8.0, 50 mMNaCl, 1 mM EDTA). 25 μL of each adapter pair was mixed into one reactionand annealed from 96° C. to 4° C. by decreasing 0.5° C. per second. 4 μlof digested and purified genomic DNA from each candidate was ligated tothe 2 μl of 25 μM adaptor using T4 DNA ligase as recommended by themanufacturer (NEB). Primary PCR with adaptor ligated genomic DNA wasperformed under the following conditions: 1 μl ligated DNA, 1×Ex TaqBuffer (Takara Bio inc), 0.5M Betaine, 3% DMSO (dimethyl sulfoxide), 0.1mM dNTPs, 1 μM Adaptor Primer 1 (SEQ ID NO: 92, see Table 3), 1 μMcassette-specific primer (SEQ ID NOS: 96, 97, 101, 102, 106, 107, 110,and 111, see Table 3 for an appropriate cassette-specific primer) and 1unit Ex Taq (Takara Bio inc) in a 20 μl reaction volume. There wereseveral options for the cassette-specific primer: hygromycin orparomomycin-specific, 5′ and/or 3′, and two or three primers within eachspecification. Primary PCR parameters were as follows: 1 cycle [95° C.for 2 min], 35 cycles [94° C. for 20 sec, annealing at 55° C. for 20sec, extension at 72° C. for 4 min], and 1 cycle [extension at 72° C.for 2 min].

A secondary nested PCR was then performed with 0.5 μl of the primary PCRreaction, 1 μM Adaptor Primer 2 (SEQ ID NO: 93, see Table 3), 1 μMnested cassette-specific primer (SEQ ID NOS: 98, 99, 100, 103, 104, 105,108, 109, 112, 113, 114, see Table 3 for an appropriate nested-cassettespecific primer), 1×Ex Taq Buffer (Takara Bio inc), 0.5M Betaine, 3%DMSO (dimethyl sulfoxide), 0.1 mM dNTPs, and 1 unit Ex Taq (Takara Bioinc) in a 20 μl reaction volume. There were several options for thenested cassette-specific primer: hygromycin or paromomycin-specific, 5′and/or 3′, and two or three primers within each specification. SecondaryPCR parameters were as follows: 1 cycle [94° C. for 2 min], 42 cycles[95° C. for 20 sec, annealing at 57° C. for 20 sec, extension at 72° C.for 4 min], and 1 cycle [extension at 72° C. for 2 min]. PCR reactionswere observed on a 1% agarose/EtBr electrophoresis gel. Bands wereexcised and purified using Zymoclean Gel DNA Recovery Kit (Zymo researchCat. #D4022). Purified DNA was sequenced using the appropriate AP2primer or the appropriate nested cassette-specific primer. BLASTanalysis was used to identify the location of the insert in theChlamydomonas reinhardtii nuclear genome(http://genome.jgi-psf.org/Chlre4/Chlre4.home.html). BLAST analysis wasused to determine the identity of the disrupted gene.

Inverse Tandem Repeat (ITR) or Suppression PCR

500 ng-1 μg genomic DNA of a candidate strain was digested with bluntend restriction enzymes (PmII and PvuII) as recommended by themanufacturer (NEB). Digested genomic DNA was purified with PromegaWizard DNA Clean-up system (Promega Cat. #A7280. In order to generatethe adaptor, both adaptor primers (see Table 3 for SEQ ID NO: 90 and SEQID NO: 91) were resuspended in STE buffer (10 mM Tris-HC pH 8.0, 50 mMNaCl, 1 mM EDTA). 25 μL of each adapter pair was mixed into one reactionand annealed from 96° C. to 4° C. by decreasing 0.5° C. per second. 4 μlof digested and purified genomic DNA was ligated to 2 μl of 25 μMadaptor using T4 DNA ligase as recommended by the manufacturer (NEB).

Primary PCR with adaptor ligated genomic DNA was performed under thefollowing conditions: 1 μl ligated DNA, 1×Ex Taq Buffer (Takara Bioinc), 0.5M Betaine, 3% DMSO (dimethyl sulfoxide), 0.1 mM dNTPs, 1 μMAdaptor Primer 3 (SEQ ID NO: 93, see Table 3), 1 μM cassette-specificprimer (SEQ ID NOS: 96, 97, 101, 102, 106, 107, 110, and 111, see Table3 for an appropriate cassette-specific primer) and 1 unit Ex Taq (TakaraBio inc) in a 20 μl reaction volume. There were several options for thecassette-specific primer: hygromycin or paromomycin-specific, 5′ and/or3′, and two or three primers within each specification. Primary PCRparameters were as follows: 1 cycle [95° C. for 2 min], 35 cycles [94°C. for 20 sec, annealing at 55° C. for 20 sec, extension at 72° C. for 4min], and 1 cycle [extension at 72° C. for 2 min].

A secondary nested PCR was then performed with 0.5 μl of the primary PCRreaction, 1 μM Adaptor Primer 4 (SEQ ID NO: 94, see Table 3), 1 μMcassette-specific primer (SEQ ID NOS: 98, 99, 100, 103, 104, 105, 108,109, 112, 113, 114, see Table 3 for an appropriate nestedcassette-specific primer), 1×Ex Taq Buffer (Takara Bio inc), 0.5MBetaine, 3% DMSO (dimethyl sulfoxide), 0.1 mM dNTPs, and 1 unit Ex Taq(Takara Bio inc) in a 20 μl reaction volume. There were several optionsfor the nested cassette-specific primer: hygromycin orparomomycin-specific, 5′ and/or 3′, and two or three primers within eachspecification. Secondary PCR parameters were as follows: 1 cycle [95° C.for 2 min], 42 cycles [95° C. for 20 see, annealing at 57° C. for 20sec, extension at 720° C. for 4 min], and 1 cycle [extension at 72° C.for 2 min]. PCR reactions were observed on a 1% agarose/EtBrelectrophoresis gel. Bands were excised and purified using Zymoclean GelDNA Recovery Kit (Zymo research Cat. #D4022). Purified DNA was sequencedusing the appropriate AP4 primer or the nested cassette-specific primer.BLAST analysis was used to identify the location of the insert in theChlamydomonas reinhardtii nuclear genome(http://genome.jgi-psf.org/Chlre4/Chlre4.home.html). BLAST analysis wasused to determine the identity of the disrupted gene.

Restriction-Site PCR

Restriction site PCR takes advantage of endogenous restriction siteswithin the genome that helps serve as priming sites for PCRamplification (Sarkar, G., et al. (1993) Genome Res. 2: 318-322).Primary PCR with candidate strain genomic DNA was performed under thefollowing conditions: 1 μl of 100 ng/μl DNA, 1×Ex Taq Buffer (Takara Bioinc), 0.5M Betaine, 3% DMSO (dimethyl sulfoxide), 0.1 mM dNTPs, 1 μM RSOprimer (SEQ ID NO: 241 or SEQ ID NO: 242 in Table 3 can be used), 1 μMcassette-specific primer (SEQ ID NOS: 96, 97, 101, 102, 106, 107, 110,and 111, see Table 3 for an appropriate cassette-specific primer), and1U Ex Taq (Takara Bio inc) in a 20 μl reaction volume. There wereseveral options for the cassette-specific primer: hygromycin orparomomycin-specific, 5′ and/or 3′, and two or three primers within eachspecification. Primary PCR parameters were as follows: 1 cycle [94° C.for 2 min], 30 cycles [94° C. for 1 min, annealing at 55° C. for 1 min,extension at 72° C. for 3 min], and 1 cycle [extension at 72° C. for 10min].

Secondary nested PCR was performed with 0.5 μl of the primary PCRreaction, 1×Ex Taq Buffer (Takara Bio, Inc.), 0.5M Betaine, 3% DMSO(dimethyl sulfoxide), 0.1 mM dNTPs, 1 μM of the same RSO primer used inthe primary PCR, 1 μM nested cassette-specific primer (SEQ ID NOS: 98,99, 100, 103, 104, 105, 108, 109, 112, 113, 114, see Table 3 for anappropriate nested cassette-specific primer), and 1U Ex Taq (Takara Bioinc) in a 20 μl reaction volume. There were several options for thecassette-specific primer: hygromycin or paromomycin-specific, 5′ and/or3′, and two or three primers within each specification. Secondary nestedPCR parameters were as follows: 1 cycle [94° C. for 2 min], 30 cycles[94° C. for 1 min, annealing at 55° C. for 1 min, extension at 72‘C’ for3 min], and 1 cycle [extension at 72° C. for 10 min]. PCR reactions wereobserved on a 1% agarose/EtBr electrophoresis gel. Bands were excisedand purified using Zymoclean Gel DNA Recovery Kit (Zymo research Cat,#D4022). Purified DNA was sequenced using the appropriate AP2 primer orthe nested cassette-specific primer. BLAST analysis was used to identifythe location of the insert in the Chlamydomonas reinhardtii nucleargenome (http://genome.jgi-psf.org/Chlre4/Chlre4.home.html). BLASTanalysis was used to determine the identity of the disrupted gene.

Artificial miRNA Mediated Silencing

Sequence characterization of the gene disruption (See Table 1) allowsfor validation by RNA interference. Expression of a transcript may besuppressed by expressing inverted repeat transgenes or artificial miRNAs(Rohr, J., et al., Plant J, 40, 611-621 (2004); Molnar et al., Nature,447:1126-1130 (2007); Molnar et al., Plant J, 58:165-174 (2009)). Anexample of the artificial miRNA system is shown in FIG. 5 and FIG. 6. Astrain transformed with an expression cassette that produces twoproteins, a Zeocin resistance protein and a xylanase (BD12), from asingle transcript, was transformed with an artificial miRNA cassette totarget the xylanase transcript. The variation of efficacy was shown bythe 12 individual strains. Some strains were not knocked down (high inxyalanse activity, high in Zeocin resistance, high in xylanasetranscript level), but some strains were knocked down (low in xlanaseactivity, sensitive to Zeocin, and low in xyalanse transcript). Thesedata verified that applying artificial miRNA constituted a validationmethod. Reproducing the salt resistance by silencing the identified genetarget validates the gene target as the genetic determinant of thephenotype.

The artificial miRNA expression vector was constructed as follows. Themodified expression vector, SENuc391 (FIG. 1), was created by usingpBluescript II SK(−) (Agilent Technologies, CA) as a vector backbone.The segment labeled “Aph 7″” was the hygromycin resistance gene fromStreptomyces hygroscopicus. The first intron from the Chlamydomonasreinhardtii rbcS2 gene was cloned into Aph 7″ in order to increaseexpression levels and consequentially, the number of transformants(Berthold et al. Protist 153:401-412 (2002)). Aph 7″ was preceded by theChlamydomonas reinhardtii β2-tubulin promoter and was followed by theChlamydomonas reinhardtii rbcS2 terminator. The hygromycin resistancecassette was cloned into the NotI and XbaI sites of pBluescript IISK(−). Subsequently, the segment labeled “Hybrid Promoter” indicates afused promoter region beginning with the C. reinhardtii Hsp70A promoter,C. reinhardtii rbcS2 promoter, and the first intron from the C.reinhardtii rbcS2 gene (Sizova et al. Gene, 277:221-229 (2001)). The“Hybrid Promoter” was PCR amplified using overlapping primers whileintroducing restriction sites to both the 5′ (XbaI) and 3′ (NdeI, BamHI,KpnI) ends. This PCR-generated fragment was cloned into the XbaI andKpnI sites of the hygromycin resistance cassette-containing pBluescriptII SK(−). The segment labeled “Aph VIII” was the paromomycin resistancegene flanked by the promoter and terminator of the C. reinhardtii psaDgene. The cassette was blunt end ligated into the digested KpnI sitetreated with Klenow.

The generation of the precursor scaffold was performed similarly aspreviously described (Molnar et al., Plant J, 58:165-174 (2009)). The 5′arm of the precursor scaffold was amplified from C. reinhardtii genomicDNA by two primers Arm Primer 1 (SEQ ID NO: 243) and Arm Primer 2 (SEQID NO: 244). The 3′ arm of the precursor scaffold was amplified by thetwo primers Arm Primer 3 (SEQ ID NO: 245) and Arm Primer 4 (SEQ ID NO:246). The two resulting PCR fragments were gel purified and fusedtogether in a PCR reaction using the primers Arm Primer 1 (SEQ ID NO:243) and Arm Primer 4 (SEQ ID NO: 246) resulting in a 259 bp fusionproduct. The PCR fragment was gel-purified, digested with AseI andBamHI, and ligated into the NdeI and BamHI sites of SEnuc391.

The transcript IDs of the candidate genes (See Table 1) were submittedto the Web MicroRNA Designer (Ossowski et al., Plant J, 53:674-690;WMD3, http://wmd3.weigelworld.org/). For each gene, predicted miRNAswere converted to full stem-loop sequences, including the endogenouscre-MIR1157 spacer, and the corresponding miRNA*, using the WMD3 Oligofunction with “pChlamiRNA2 and 3” selected as the vector. The resultingsequences were modified by adding flanking BglII sites, as well asadding sequence complementary to the 5′ end of the antisense strand ofthe BD11 (SEQ ID NO. 228) sequence to the 3′ end. The modified sequenceswere synthesized and Table 2 shows the artificial miRNA sequences thatare associated with the NaCl candidate strain number and gene sequence.In order to clone the miRNA stem-loop sequences into SENuc391, acomplementary strand was first added by PCR amplification in thepresence of BD11, each ultramer, and a primer (SEQ ID NO. 229) in a2-cycle Phusion PCR reaction following the manufacturer's instructions(Finnzymes). The resulting double-stranded DNA fragments were clonedinto the BglII site of SENuc391. The resulting plasmid was sequenced forthe appropriate orientation.

TABLE 2 Sequence Listing Strain amiRNA Sequence Number Number Number SEQID NO: 115 S7 SEQ ID NO: 207 SEQ ID NO: 116 S16 SEQ ID NO: 208 SEQ IDNO: 122 S65 SEQ ID NO: 209 SEQ ID NO: 201 S1659 SEQ ID NO: 223 SEQ IDNO: 129 S77 SEQ ID NO: 210 SEQ ID NO: 202 S1666 SEQ ID NO: 224 SEQ IDNO: 206 S1704 SEQ ID NO: 227 SEQ ID NO: 133 S105 SEQ ID NO: 211 SEQ IDNO: 198 S1612 SEQ ID NO: 221 SEQ ID NO: 200 S1644 SEQ ID NO: 222 SEQ IDNO: 204 S1693 SEQ ID NO: 226 SEQ ID NO: 203 S1687 SEQ ID NO: 225 SEQ IDNO: 135 S129 SEQ ID NO: 213 SEQ ID NO: 134 S123 SEQ ID NO: 212 SEQ IDNO: 166 S289 SEQ ID NO: 215 SEQ ID NO: 162 S276 SEQ ID NO: 214 SEQ IDNO: 169 S292 SEQ ID NO: 217 SEQ ID NO: 168 S291 SEQ ID NO: 216 SEQ IDNO: 170 S294 SEQ ID NO: 218 SEQ ID NO: 175 S338 SEQ ID NO: 220 SEQ IDNO: 127 S74 SEQ ID NO: 235 SEQ ID NO: 230 S1613 SEQ ID NO: 236 SEQ IDNO: 231 S1621 SEQ ID NO: 237 SEQ ID NO: 232 S1623 SEQ ID NO: 238 SEQ IDNO: 233 S1638 SEQ ID NO: 239 SEQ ID NO: 234 S1655 SEQ ID NO: 240

Preparation of the transformation DNA involves a restriction digest withthe enzymes PsiI to linearize the DNA. All transformations were carriedout on C. reinhardtii cc1690 (mt+). Cells were grown and transformed viaelectroporation. Cells were grown to mid-log phase (approximately2-6×10⁶ cells/ml) in TAP media. Cells were spun down gently (between2000 and 5000×g) for 5 min. The supernatant was removed and the cellswere resuspended in TAP media+40 mM sucrose. 1 μg (in 1-5 μL H₂O) oftransformation DNA was mixed with 250 μL of 3×10⁸ cells/mL on ice andtransferred to 0.4 cm electroporation cuvettes. Electroporation wasperformed with the capacitance set at 25 uF, the voltage at 800 V todeliver 2000 V/cm resulting in a time constant of approximately 10-14ms. Following electroporation, the cuvette was returned to roomtemperature for 5-20 min. Cells were transferred to 10 ml of TAPmedia+40 mM sucrose and allowed to recover at room temperature for 12-16hours with continuous shaking. Cells were then harvested bycentrifugation for 5 min at between 2000×g and 5000×g, the supernatantwas discarded, and the pellet was resuspended in 0.5 ml TAP media+40 mMsucrose. The resuspended cells were then plated on solid TAP media+10μg/mL hygromycin and +10 μg/mL paromomycin.

Selection

42 Colonies transformed with artificial miRNA constructs were pickedinto a 96-well microtiter plate and grown in 200 μl G₀ media at roomtemperature in high light in a box fed with 5% CO₂. Also included was apositive control that was highly resistant to NaCl, the original genedisruption strain as a control, and wildtype C. reinhardtii cc1690 (mt+)negative control. Once cultures were grown to saturation, 2 μl ofculture was pipetted onto solid G₀ media+75 mM NaCl, G₀ media+100 mMNaCl, G₀ media+125 mM NaCl (FIG. 15-38). In FIGS. 15-32, the wildtypenegative control is in position row 8 column 6, original gene disruptionstrain in position row 8 column 5, and the NaCl resistant positivecontrol in position row 8 column 3 and 4. In FIGS. 33-37, the wildtypenegative control is in position row 8 column 1, original gene disruptionstrain in position row 8 column 3, and the NaCl resistant positivecontrol in position row 8 column 5. For FIG. 38, the wildtype negativecontrol is in position row 8 column 6, original gene disruption strainin position row 8 column 4, and the NaCl resistant positive control inposition row 8 column 2. Plates were grown at room temperature in highlight in a box fed with 5% CO₂.

Random integration into the nuclear genome affects protein expression bypositional effect. This effect was also observed when expressingartificial miRNA. Validation of the gene target was indicated by thedistribution of salt gene-targeting artificial miRNA transformants thatare resistant to NaCl (FIGS. 15-38) were also compared to the resistanceof transformants of a random DNA fragment, for example, an artificialmiRNA targeting a non-salt target. The percentage of highly resistantstrains was a product of both the validity of the gene target and miRNAdesign. These results confirm that the genes represented by S7 (Augustusv.5 Protein ID: 523016), S16 (Protein ID: 195781), S65 (Augustus v.5Protein ID: 517886), S1659 (Protein ID: 178706), S77 (Augustus v.5Protein ID: 522165), S1666 (Augustus v.5 Protein ID: 514721), S1704(Protein ID: 77062), S105 (Augustus v.5 Protein ID: 524679), S1612(Protein ID: 103075), S1644 (Protein ID: 331285), S1693 (Protein ID:188114), S1687 (Protein ID: 291633), S129 (Augustus v.5 Protein ID:510051), 5123 (Augustus v.5 Protein ID: 519822), S289 (Augustus v.5Protein ID: 518128), S276 (Augustus v.5 Protein ID: 512487). S292(Augustus v.5 Protein ID: 524030), 5291 (Augustus v.5 Protein ID:516191), S294 (Augustus v.5 Protein ID: 522637), S338 (Augustus v.5Protein ID: 512361), S74 (Augustus v.5 Protein ID: 520845), S1613(Protein ID: 174261), S1621 (Protein ID: 206559), S1623 (Protein ID:116195), S1638 (Protein ID: 418706), S1655 (Augustus v.5 Protein ID:525078) confer NaCl resistance when disrupted by insertion and/orsilencing.

Phenotypes of some knockdown transformants (for strains S7 and S16) weretested on gradient salt: agar plates. 200 ml of G₀ agar and 200 ml of G₀agar+200 mM NaCl were made. One edge of a 9″×9″ bioassay tray was set ata 0.9 cm higher to create an angle. G₀ agar was poured first and left tosolidify at an angle. The plate was returned to a level position and 200ml of G₀ agar+200 mM NaCl was subsequently poured. Candidate strains S7,S16, along with their associated artificial miRNA transformation strainswere grown up in G₀ media. Approximately 1 mL of 2.5×10⁷ cells/mlculture was spread across a one-inch section of a 9″×9″ bioassay trayusing a sterile loop. Plated candidate strains were spread in the samedirection the gradient was poured. Wildtype C. reinhardtii was added tothe plate as well. In FIGS. 12, 13, and 14, the gene disruption strainsand the knockdown transformations all show a considerable increase insalt tolerance.

Example 4 QPCR

In this example, the transcript levels of 4 salt gene targets, namely S7(Protein ID: 143076, Augustus v.5 ID: 523016, SEQ ID NO: 115), S16(Protein ID: 192517, SEQ ID NO: 116), S77 (Augustus v.5 ID: 522165, SEQID NO: 129) and S338 (Augustus v.5 ID: 512361, SEQ ID NO: 175) and theirrelated artificial miRNA knockdown strains were examined by quantitativePCR and salt resistance. All DNA manipulations carried out in thisexample were essentially as described by Sambrook et al., MolecularCloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press 1989)and Cohen et al., Meth. Enzymol. 297, 192-208, 1998.

Further validation was performed on individual knockdown transformantsby quantitative PCR to correlate phenotype to transcript levels.Decreased transcript levels were observed with an increase in saltresistance thereby further demonstrating that the phenotype isgenetically linked to the gene disruption. For S7 and S16, the leadingedge of the miRNA-2 bands on the gradient plates in FIGS. 12, 13, and 14were taken and struck for single colonies. Five colonies were taken. ForS77 and S338, 6 random knockdown transformants were taken. Knockdownstrains were grown in 5 ml of G₀ media in high light in a box fed with5% CO₂. Algae biomass was resuspended in plant RNA reagent (Invitrogen)and RNA was extracted according to the manufacturer. Residual DNA wasremoved by using RNeasy spin-column cleanup (Qiagen) to ensure purifiedRNA according to the manufacturer. 500 ng of RNA was reverse transcribedusing iScript cDNA Synthesis Kit (Bio-Rad Laboratories) and theresulting cDNA was diluted ten-fold before PCR amplification.

Real time PCR was performed using Biorad's MyiQ2 Two-Color Real-Time PCRDetection System. Primers used in the qPCR analysis were designed andtested to ensure consistency. Reactions were performed in a 25 μl volumewith the 6 μl of 4 μM primer mix, 6 μl of diluted cDNA, and 12.5 μl ofiQ SYBR green super mix which contains dNTPs, iTaq polymerase, 6 mMMgCl2, SYBR green I, 20 nM fluorescein. The protocol was as follows: 1cycle [95° C. for 30 sec], 45 cycles [95° C. for 10 sec followed by 57°C. for 30 sec], and 77 cycles [extension at 57° C. for 10 sec]. Thequantification data were analyzed using the iQ5 software. Transcriptlevels are normalized and compared to wildtype using the transcriptlevels of a housekeeping gene. The qPCR results for S7, S16, S77, S338are shown in FIG. 10, 11, 39, and FIG. 40, respectively. As only algaethat were tolerant for salt were taken for S7 and S16, the transcriptlevels for all six samples were decreased significantly. For the casesof S77 and S338, the knockdown strains were all salt tolerant. Therewere subtle differences for both cases. In FIG. 39, strains S77-4,S77-5, and S77-6 had slightly higher transcript levels that correlatewith decreased salt tolerance whereas those with reduced transcriptlevels (S77-1, S77-2, and S77-3) correlate with increased salttolerance. In FIG. 40, strain S338-4 had a slightly higher transcriptlevel that correlates with decreased salt tolerance whereas those withreduced transcript levels (S338-1, S338-2, S338-3, S338-5, and S338-6)correlate with increased salt tolerance. Decreased transcript levels andsalt tolerance along with unchanged transcript levels and saltsensitivity further validate these gene targets as conferring saltresistance by knockout or knockdown.

TABLE 3 Adaptor Pairs Adaptor Ligation Method or Cassette PCR AdaptorAdaptor 1- 5′ SEQ ID GTAATACGACTCACTATAGAGTAC NO: 88GCGTGGTCGACGGCCCGGGCTGGT Adaptor 2- 3′ SEQ ID 5′ Phos-ACCAGCCCGG 3′  NO: 89 Amino Modifier Inverse Tandem Repeat(ITR)/Suppression PCR Adaptor Adaptor 3- 5′ SEQ IDCTAATACGACTCACTATAGGGC NO: 90 TCGAGCGGCCGCCCGGGCAGGT Adaptor 4- 3′SEQ ID ACCTGCCCGGGCGGCCGCTCGA NO: 91 GCCCTATAGTGAGTCGTATTAGAdaptor Primers Adaptor Ligation Method or Cassette PCR AdaptorAdaptor Primer SEQ ID GTAATACGACTCACTATAGAGT 1 NO: 92 Adaptor PrimerSEQ ID ACTATAGAGTACGC GTGGT 2 NO: 93 Inverse Tandem Repeat(ITR)/Suppression PCR Adaptor Adaptor Primer SEQ ID CTAATACGACTCACTATAGG3 NO: 94 Adaptor Primer SEQ ID ACTATAGGGCTCGAGCGGCC 4 NO: 95Hygromycin Cassette (SENuc 146) 3′ cassette- SEQ IDGACCAACATCTTCGTGGACCT specific NO: 96 GGCCGC primer 1 3′ cassette-SEQ ID GACCAACATCTTCGTGGACCT specific  NO: 97 primer 2 3′ nested  SEQ IDACTTCGAGGTGTTCGAGGAGA cassette- NO: 98 CCCCGC specific  primer 1 3′nested  SEQ ID CTGGTGCAACTGCATCTCAAC cassette- NO: 99 specific  primer 23′ nested  SEQ ID ACTTCGAGGTGTTCGAGGAGAC cassette- NO: 100 specific primer 3 5′ cassette- SEQ ID CTCGCCGAACAGCTTGAT specific  NO: 101primer 1 5′ cassette- SEQ ID GGCTCATCACCAGGTAGGG specific  NO: 102primer 2 5′ nested  SEQ ID CGAATCAATACGGTCGAGAAGT cassette- NO: 103AACAG specific  primer 1 5′ nested  SEQ ID CGAATCAATACGGTCGAGAAGTcassette- NO: 104 specific  primer 2 5′ nested  SEQ IDAACAGGGATTCTTGTGTCATGTT cassette- NO: 105 specific  primer 3Paromomycin Cassette (SENuc 140) 3′ cassette- SEQ ID CTGCTCGACCCTCGTACCTspecific  NO: 106 primer 1 3′ cassette- SEQ ID GACTTGGAGGATCTGGACGAGspecific  NO: 107 primer 2 3′ nested  SEQ ID CTGCTCGACCCTCGTACCTcassette- NO: 108 specific  primer 1 3′ nested  SEQ IDGAAAAGCTGGCGTTTTACCG cassette- NO: 109 specific  primer 2 5′ cassette-SEQ ID AGAGCTGCCACCTTGACAAA specific  NO: 110 CAACTC primer 1 5′cassette- SEQ ID CAACACGAGGTACGGGAATC specific  NO: 111 primer 2 5′nested  SEQ ID TCCTCCACAACAACCCACTCA cassette- NO: 112 CAACCG specific primer 1 5′ nested  SEQ ID GAGCTGCCACCTTGACAAAC cassette- NO: 113specific  primer 2 5′ nested  SEQ ID TCCTCCACAACAACCCACTC cassette-NO: 114 specific  primer 3 RSO Restriction Site Primer  SequencesAgeI Primer SEQ ID TAATACGACTCACTATAGGGNNN NO: 241 NNNNNNNACCGGTKpnI Primer SEQ ID TAATACGACTCACTATAGGGNNN NO: 242 NNNNNNNGGTACCArtificial miRNA Cloning Primers 5′ Arm Primer 1 SEQ IDGACTATTAATGGTGTTGGGTCGG NO: 243 TGTTTTTGGTC 5′ Arm Primer 2 SEQ IDAGATCTCAGCTGGAACACTGCGC NO: 244 CCAGG 3′ Arm Primer 3 SEQ IDGCAGTGTTCCAGCTGAGATCTAG NO: 245 CCGGAACACTGCCAGGAAG 3′ Arm Primer 4SEQ ID GACTGGATCCGGTGTAACTAAGC NO: 246 CAGCCCAAAC

RNA Blot Analyses

The transcript expression levels of the target gene in a transgenic cellline can be detected using an RNA blot technique. The RNA extraction andsmall RNA detection can be performed as described (for example, asdescribed in Molnar et al., Nature, 447:1126-1129 (2007)). A detailedprotocol can be found, for example, athttp://www.plantsci.cam.ac.uk/Baulcombe/pdfs/smallrna.pdf. Total RNA isisolated, separated in a 15% denaturing polyacrylamide gel, and blottedto Hybond N+ (GE Lifesciences, http://www.gelifesciences.com). DNAoligonucleotides complementing to the reverse complement of an amiRNAsequence are labeled with polynucleotide kinase (PNK) in the presence ofγ³²P-ATP and hybridized to the immobilized RNA. Decade RNA marker(Ambion, USA, http://www.ambion.com) labeled according to themanufacturer's instructions, is used as a size marker.

Example 5 Other Methods to Generate Salt Tolerant Strains by Knock Outand/or Knock Down

There are many useful approaches to generating salt tolerant strainsonce the sequence characterization of the gene disruption is known. Asmentioned in Example 3, the expression of an artificial miRNA led to adecrease in transcript levels. Other methods of RNA silencing involvethe use of a tandem inverted repeat system (Rohr et al., Plant J,40:611-621 (2004)) where a 100-500 bp region of the targeted genetranscript is expressed as an inverted repeat. The advantage ofsilencing is that there can be varying degrees in which the targettranscript is knocked down. Oftentimes, expression of the transcript isnecessary for the viability of the cell. Thus, there can exist anintermediate level of expression that allows for both viability and alsothe desired phenotype (e.g. salt tolerance). Finding the specific levelof expression that is necessary to produce the phenotype is possiblethrough silencing.

Homologous recombination can be carried out by a number of methods andhas been demonstrated in green algae (Zorin et al., Gene, 423:91-96(2009); Mages et al., Protist 158:435-446 (2007)). A knock out can beobtained through homologous recombination where the gene product (e.g.mRNA transcript) is eliminated by gene deletion or an insertion ofexogenous DNA that disrupts the gene.

Gene Deletion

One such way is to PCR amplify two non-contiguous regions (from severalhundred DNA base pairs to several thousand DNA base pairs) of the gene.These two non-contiguous regions are referred to as Homology Region 1and Homology Region 2 are cloned into a plasmid. The plasmid can then beused to transform the host organism to create a knockout,

Gene Insertion

Another way is to PCR amplify two contiguous or two non-contiguousregions (from several hundred DNA base pairs to several thousand DNAbase pairs) of the gene. A third sequence is ligated between the firstand second regions, and the resulting construct is cloned into aplasmid. The plasmid can then be used to transform the host organism tocreate a knockout. The third sequence can be, for example, an antibioticselectable marker cassette, an auxotrophic marker cassette, a proteinexpression cassette, or multiple cassettes.

While certain embodiments have been shown and described herein, it willbe obvious to those skilled in the art that such embodiments areprovided by way of example only. Numerous variations, changes, andsubstitutions will now occur to those skilled in the art withoutdeparting from the disclosure. It should be understood that variousalternatives to the embodiments of the disclosure described herein maybe employed in practicing the disclosure. It is intended that thefollowing claims define the scope of the disclosure and that methods andstructures within the scope of these claims and their equivalents becovered thereby.

What is claimed is:
 1. A genetically modified non-vascularphotosynthetic organism comprising at least one RNAi agent, said atleast one RNAi agent comprising an antisense nucleotide sequence that iscomplementary to the mRNA transcribed from SEQ ID NO. 115 or a sequencehaving at least 95% identity thereto; wherein said non-vascularphotosynthetic organism has an increased growth rate in an aqueousenvironment containing between about 75 mM and 275 mM sodium chloride ascompared to a genetically unmodified non-vascular photosyntheticorganism of the same species lacking said at least one RNAi agent; andwherein said photosynthetic organism is at least one of Chlamvdomonassp, Volvacales sp, Dunaliella sp, Scenedesmus sp, Chloralla sp,Hematococcus sp., Volvox sp, or Nannochloropsis sp.
 2. The geneticallymodified organism of claim 1, wherein said at least one RNAi agent is amicroRNA (miRNA).
 3. The genetically modified organism of claim 1,wherein said at least one RNAi agent is a small interfering RNA (siRNA).4. The genetically modified organism of any one of claims 1-3, whereinactivity of a protein encoded by SEQ ID NO. 115 or a sequence having atleast 95% sequence identity to said protein is decreased as compared tothe activity of said protein in a genetically unmodified non-vascularphotosynthetic organism of the same species lacking said at least oneRNAi agent; and wherein said photosynthetic organism is at least one ofChlamvdomonas sp, Volvacales sp, Dunaliella sp, Scenedesmus sp,Chloralla sp, Hematococcus sp., Volvox sp, or Nannochloropsis sp.
 5. Thegenetically modified organism of claim 4, wherein said activity isdecreased by at least 10%.
 6. The genetically modified organism of claim4, wherein said activity is decreased by at least 20%.
 7. Thegenetically modified organism of claim 4, wherein said activity isdecreased by at least 30%.
 8. The genetically modified organism of claim4, wherein said activity is decreased by at least 40%.
 9. Thegenetically modified organism of claim 4, wherein said activity isdecreased by at least 50%.
 10. The genetically modified organism ofclaim 4, wherein said activity is decreased by at least 60%.
 11. Thegenetically modified organism of claim 4, wherein said activity isdecreased by at least 70%.
 12. The genetically modified organism ofclaim 4, wherein said activity is decreased by at least 80%.
 13. Thegenetically modified organism of claim 4, wherein said activity isdecreased by at least 90%.
 14. The genetically modified organism ofclaim 4, wherein said activity is decreased by 100%.
 15. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 10% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 16. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 20% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 17. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 40% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 18. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 60% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 19. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 80% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 20. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 100% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 21. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 150% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 22. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 200% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 23. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 250% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 24. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 300% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 25. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 350% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 26. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 400% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 27. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 450% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 28. The geneticallymodified organism of any one of claims 1-3, wherein the growth rate ofsaid genetically modified organism is at least 500% greater than saidgenetically unmodified non-vascular photosynthetic organism of the samespecies lacking said at least one RNAi agent.
 29. The geneticallymodified organism of any one of claims 1-3, wherein said transcribedmRNA is decreased as compared to said genetically unmodifiednon-vascular photosynthetic organism of the same species lacking said atleast one RNAi agent.
 30. The genetically modified organism of claim 29,wherein said transcribed mRNA is decreased by at least 10%.
 31. Thegenetically modified organism of claim 29, wherein said transcribed mRNAis decreased by at least 20%.
 32. The genetically modified organism ofclaim 29, wherein said transcribed mRNA is decreased by at least 30%.33. The genetically modified organism of claim 29, wherein saidtranscribed mRNA is decreased by at least 40%.
 34. The geneticallymodified organism of claim 29, wherein said transcribed mRNA isdecreased by at least 50%.
 35. The genetically modified organism ofclaim 29, wherein said transcribed mRNA is decreased by at least 60%.36. The genetically modified organism of claim 29, wherein saidtranscribed mRNA is decreased by at least 70%.
 37. The geneticallymodified organism of claim 29, wherein said transcribed mRNA isdecreased by at least 80%.
 38. The genetically modified organism ofclaim 29, wherein said transcribed mRNA is decreased by at least 90%.39. The genetically modified organism of claim 29, wherein saidtranscribed mRNA is decreased by 100%.
 40. The genetically modifiedorganism of any one of claims 1-3, wherein a protein encoded by saidtranscribed mRNA is decreased as compared to the protein encoded by saidtranscribed mRNA in said genetically unmodified non-vascularphotosynthetic organism lacking said at least one RNAi agent.
 41. Thegenetically modified organism of claim 40, wherein said protein isdecreased by at least 10%.
 42. The genetically modified organism ofclaim 40, wherein said protein is decreased by at least 20%.
 43. Thegenetically modified organism of claim 40, wherein said protein isdecreased by at least 30%.
 44. The genetically modified organism ofclaim 40, wherein said protein is decreased by at least 40%.
 45. Thegenetically modified organism of claim 40, wherein said protein isdecreased by at least 50%.
 46. The genetically modified organism ofclaim 40, wherein said protein is decreased by at least 60%.
 47. Thegenetically modified organism of claim 40, wherein said protein isdecreased by at least 70%.
 48. The genetically modified organism ofclaim 40, wherein said protein is decreased by at least 80%.
 49. Thegenetically modified organism of claim 40, wherein said protein isdecreased by at least 90%.
 50. The genetically modified organism ofclaim 40, wherein said protein is decreased by 100%.
 51. The geneticallymodified organism of any one of claims 1-3, wherein said sodium chloridein said aqueous environment is between 100 mM and 250 mM.
 52. Thegenetically modified organism of any one of claims 1-3, wherein saidsodium chloride in said aqueous environment is between 100 mM and 200mM.
 53. The genetically modified organism of any one of claims 1-3,wherein said sodium chloride in said aqueous environment is between 75mM and 175 mM.
 54. The genetically modified organism of any one ofclaims 1-3, wherein said sodium chloride in said aqueous environment isbetween 75 mM and 150 mM.
 55. The genetically modified organism of anyone of claims 1-3, wherein said sodium chloride in said aqueousenvironment is between 100 mM and 275 mM.
 56. The genetically modifiedorganism of any one of claims 1-3 wherein said sodium chloride in saidaqueous environment is between 150 mM and 275 mM.
 57. The geneticallymodified organism of any one of claims 1-3, wherein said sodium chloridein said aqueous environment is between 200 mM and 275 mM.